WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




B-17 

S/X: 09/989,994 
8325-0011.21 



PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 




(11) International Publication Number: 


WO 99/41371 


C12N 15/11, 9/00, C12Q 1/68, C12N 
15/86, 5/10 


Al 


(43) International Publication Date: 


19 August 1999 (19.08.99) 



(21) International Application Number: PCT/US99/03 1 66 

(22) International Filing Date: 12 February 1999 (12.02.99) 



(30) Priority Data: 

09/023,992 



13 February 1998 (13.02.98) US 



(63) Related by Continuation (CON) or Continiiation-in-Part 
(CIP) to Earlier Application 

US 09/023,992 (CIP) 

Filed on 13 February 1998 (13.02.98) 



f"M \ 1 nn l! AnH ( // 1 ) J ...J C~* j MCI. CTT) ATA DTA 

SCIENCES INCORPORATED [-/US]; 850 Marina Village 
Parkway, Alameda, CA 94501 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): KECK, James, G. 
[US/US]; 617 Harbor Colony Court, Redwood City, CA 
94065 (US). WONG, Justin, G., P. [US/US]; Apartment 3, 
389 Somerset Road, Oakland, CA 9461 1 (US). 

(74) Agent: R A E- VENTER, Barbara; Rae^ Venter Law Group, 
P.C., P.O. Box 60039, Palo Alto, CA 94306 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI t GB, GE, 
GH, GM, HR, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, 
LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, 
MX, NO, N2, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, UA, UG f US, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the Ume umii for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



1 



(54) Title: USE OF RIBOZYMES FOR FUNCTIONATING GENES 



5'-N NNN NNN UGU/C NUN NNN UGU/C -3' 
3-N ACA/GNA NNNACA/G-5' 

C 

A 

A 



A 
A 

G 

C G 
A U 
G C 
G C 
A G 
G U 



U 



(57) Abstract 



ii.in^ inn-" in ; ..: gc , ... k ;. a.... , vi.pfi:- : in -p; : , r .,. : ; ^ Vl , SN 



■ ■■ : —-v.. .,.*Cu ,•: uuuiruiu uk lui.^ucn i.if ^..illA Hut 1C1L JUu. I SjXJUM.. ptieilOtVpe L .' ! ICSpOIIM' 

:na> be associated with normai cellular process, or i: may contriDute to the generation of pathogenesis involved in disease dcveiopme.nl 
Ine compositions fine use in high-throughput screens to assign gene functions When associated wiLh a pathogenic piicnotype, these genes 
or their gene products can constitute therapeutic targets tor treatment of diseases. The complete sequence of the gene containing the target 
nacleir acid need not tn tv* known for the mrrhnd ro ho n^rd Nurrr^MK 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 




Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Fmland 


IT 


Liihuanta 


SK 


Slovakia 


AT 


Austria 


KR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina haso 


GK 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HL 


Hungary 


MI, 


Mali 


TT 


Trinidad and Tobago 


BJ 


Denm 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


11 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United Stales of America 


CA 


Canada 


IT 


Italy 


MX 


Me* ica 


uz 


Uzbekistan 


CK 


Central African Republic 




Japan 


NK 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


( II 




K C 




NO 




7\V 




lU 


\ if rin.il" - 


1 1 


i > . Urn Mr i 


SI) 


Nlkli-U; 






1>K 


1 JcrmaK 


LK 


S n I ;in k a 


St 


Sweden 






h r 




LK 


[.ihen,: 


SG 


Singapore 







WO 99/41371 PCT/US99/03166 



USE OF RIBOZYMES FOR FUNCTIONATING GENES 

INTRODUCTION 

5 Technical Field 

The present invention is related to methods and compositions for identifying a gene or 
genes associated with the generation of a specific cellular phenotype or a specific cellular 
response using combinatorial libraries of catalytic RNA directed against RNA sequences 
encoding structural or functional polypeptide motifs. The invention is exemplified by use of a 
10 combinatorial ribozyme library to target sequences in mRNAs encoding zinc finger, protein 
kinase and integrin motifs. 

Background 

Properly functioning cells are necessary for any organism, including humans, to 
15 thrive; improperly functioning cells may contribute to the development of pathogenic or 
disease states in a given individual, including generation of cancers, autoimmune diseases, 
innate immunodeficiencies, neurologic diseases, and inborn errors of metabolism. In 
addition, even properly functioning cells may contribute to pathogenic states, including 
susceptibility to infectious agents, atopic/allergic pathogeneses, and pathogenic states 
20 associated with allograft transplantation. In both of the above cases, inappropriate 

expression, regulation, or function of a specific gene product or gene products within a cell 
may lead to the improper behavior of that cell within the context of its normal function in an 
organism. Often, the activity of a single gene product, such as a protein or polypeptide, will 
affect the expression, regulation, or function of other gene products within the same cell or 
15 within neighboring cells. Aberrant expression, regulation, or function of these aggregated 
gene products may then result in the development of specific disease phenotypes or 
syndromes. 

Approaches that have been used to identify genes which are potentially involved in a 

cells Ihese approaches have given rise in a rapid increase m the number ol DN A sequences 
associated with various pathologic states. These sequences include not only full length genes. 
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but also cDNA sequences comprised of partial gene sequences or ESTs. Although sequences 
identified by these processes are associated with a pathologic state, it is difficult to ascertain a 
priori whether a given gene is directly involved in the disease development process, or 
whether its expression occurs in a secondary fashion after the pathogenic process has already 
5 begun. 

Involvement of particular genes as causative agents in the disease development process 
can be confirmed by a number of methods. Confirmation of the role of particular genes in 
the disease development process using partial cDNA sequences is more difficult to assess, 

nf th** methods meri rentiire knowledge of the full pene seouence 

10 Thus, while the number of potentially novel genes has expanded exponentially, identification 
of the functions ascribed to most of these genes and gene sequences, as well as their 
prospective roles in disease development has lagged far behind. 

One way to establish the causative effect of a gene or gene sequence in the 
development of a specific cellular phenotype or response is to interfere with the expression or 

15 function of that gene or gene product, and then to determine the resulting effect on that 

cellular phenotype or response. Methods utilized to interfere with gene expression in vivo 
involve gene targeting by homologous recombination in embryonic stem cells, re-implantation 
of the stem cells, gestation of the embryos, and isolation of animals bearing diallellic 
deletions in the gene of interest, so called "transgenic technology". The development of 

20 transgenic technology has been an important advance in the tools available for studying the 
function of genes at the organismal level. Because this procedure can take up to a year to 
complete, however, it is not an efficient process for the high-throughput evaluation of genes 
or gene products as causative agents and as potential therapeutic targets. Methods utilized to 
interfere with gene expression in vitro include gene deletion or inactivation by homologous 

25 recombination or triplex technology, RNA transcript inactivation or cleavage by antisense or 
ribozyme technology, and protein inactivation or down-regulation by antipeptide antibody 
fragments or expression of randomized peptides. A limitation to utilizing systems expressing 
randomized peptides, antisense RNA molecules, or antipeptide antibodies to identify gene 

necessar\ to affect a cellular function or phenotype 

Ribozvmes arc RNA molecules that act as enzvmes and can he engineered to cleave 
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other RNA molecules. Thus, ribozymes perform functions in the cell that are very different 
from ordinary RNA, in that, after binding selectively to their specific mRNA target, they act 
catalytically to cut, or cleave, target RNA molecules at specific sites. If an mRNA target in a 
cell is destroyed, the particular protein for which that mRNA molecule carries information is 
5 not produced. The ribozyme itself is not consumed in this process, and can act catalytically 
to cleave multiple copies of mRNA target molecules. One way to use ribozymes to identify 
the function of novel gene sequences is to introduce a pool of ribozymes with degenerate 
target recognition sites into cells in order to reduce or eliminate the expression of a gene or 
pene nrnduct involved in the generation of a specific cellular phenotype or response. In this 

o r * ' ~ 

10 strategy, ribozymes bearing the appropriate recognition sequences eliminate or reduce 
expression of the target gene, while ribozymes not bearing the appropriate recognition 
sequences do not. Loss of a specific cellular phenotype or response associated with 
elimination or reduction in expression of a target gene indicates involvement of that particular 
gene in the development of that particular phenotype or response. 

15 Of the estimated 100,000 expressed genes in a mammalian cell, approximately one- 

third are likely to be necessary for normal cell respiration, metabolism, or viability. A 
totally degenerate ribozyme library would by necessity include ribozymes directed against 
these "housekeeping genes" as well as against genes involved in disease processes. Cleavage 
of housekeeping RNAs results in compromised cellular viability, so no information can be 

20 gained from a great number of the ribozyme sequences in such a library. This problem 

reduces the efficiency of using totally degenerate ribozyme libraries to identify and assign a 
function to novel genes or gene sequences with respect to a disease development process. 
Another major limitation to this system is the need to synthesize and express a completely 
randomized library of nucleic acids and to screen the library for functional activity. The 

25 minimal targeting or recognition sequence of a ribozyme is generally 12 nucleotides and a 

totally random library would contain 4 12 or approximately 16 million ribozymes. Due to the 
large number of permutations of the ribozyme binding sequences, a specific targeting 
approach is essential. It is therefore of interest to develop a high throughput ribozyme based 

disease or phenotype. 
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Relevant Literature 

An RNA molecule not naturally occurring in nature having enzymatic activity 
independent of any protein is disclosed in USPN 4,987,071 . General rules for the design of 
hammerhead ribozymes that cleave target RNA in trans are described in Haseloff and 
5 Gerlach, (1988) Nature 334:585-591. Miniribozymes are disclosed in Uhlenbeck, (1987) 
Nature 328:596-603. Methods for optimizing cleavage of a target RNA by a ribozyme are 
described in USPN 5,496,698. Reporter gene suppression by engineered hammerhead 
ribozymes in mammalian cells is described in Cameron and Jennings, (1989) Proc. Natl. 

a c~; /T to a \ OiC.niin C^^A1 DiK/^Ti/m^ *»vr«-f»ccir\n fmm a retroviral vwtnr i« Hf»crrtfv*H 

10 in Sullenger and Cech, (1993) Science 262:1566-1569. The expression of hammerhead 
ribozymes operatively linked to a T7 promoter is described in Chowrira et aL % (1994) J. 
Biol. Chem. 269:25856-25864. Co-localizing ribozymes with substrate RNAs to increase 
their efficacy as gene inhibitors is described in Sullenger, (1995) Appl. Biochem. Biotechnol. 
54:57-61. Screening of retroviral cDNA expression libraries is described in Kitamura, et al, 

15 (1995) Proc. Nat. Acad. Sci. (USA) 92:9146. Selection of efficient cleavage sites in target 
RNAs by using a ribozyme expression library is described in Lieber and Strauss, (1996) MoL 
Cell. Biol. 15:540-551 . Approaches for the identification and cloning of differentially 
expressed genes is discussed in Soares, (1997) Curr. Opin. Biotechnol. 8:542-546. The 
development of high-throughput screen is discussed in Jayawickreme and Kost, (1997) Curr. 

20 Opin. Biotechnol. 8:629-634. The high throughput screen for rarely transcribed differentially 
expressed genes is described in von Stein et a/., (1997) Nucleic Acids Res. 25:2598-2602. 
High-throughput genotyping is disclosed in Hall, etaL* (1996) Genome Res 6:781-790. 
Methods for screening transdominant intracellular effector peptides and RNA molecules are 
disclosed in W097/27212 and W097/27213. 

25 

SUMMARY OF THE INVENTION 

Methods and compositions for their use therein, are provided for determining and 
validating a link berween a tareet nucleic acid which includes a nucleotide sequence that 

?v./\niv Mhrar,. :< srn '-nu^o >i ioc :i:cmnei - ■■ ::u- : ir\ vv me :mr;n\ ; :,.:ikii .: i\nvin;i; :-N-i. 
which is complementary to a transcription product of the target nucleic acid and a catalytic 
domain which cleaves a sequence within a transcription product of the target nucleic acid 
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coding for the motif of interest so that expression of the transcription product is disrupted. 
The method includes the steps of designing a combinatorial ribozyme library by analyzing a 
consensus nucleotide sequence encoding a protein motif and synthesizing members of a 
library of sense strands of DNA which, when expressed as RNA constitute the members of a 
5 ribozyme library; annealing the sense strands to antisense strands to form double stranded 
DNAs, introducing the double stranded DNAs, which optionally include a means for 
determining directionality of expression, into expression vectors; contacting a host cell 
culture containing one or more host cells with the expression vector(s) under conditions such 
that the expression vectors transfect or infect the host cells; growing the host cells to express 

10 the ribozyme(s); analyzing the phenotype of, or a suitable detectable marker in, the resultant 
transfected or infected host cells to identify any altered host cell by virtue of an alteration in 
phenotype or marker as compared to unmodified host cells; isolating altered host cells; and 
correlating the phenotype of altered host cells with the identity of the target nucleic acid 
encoding the motif of interest by isolating DNA from the isolated altered host cells and 

15 determining the specific ribozyme sequence contained in the isolated DNA which is 
complementary to sequences in the target nucleic acid so as to assign a function to the 
product coded for by the target nucleic acid. The ribozyme libraries and subject methods can 
be used, for example, for functionating a gene encoding a protein that contains a motif of 
interest, such as a gene involved in apoptosis, drug susceptibility, cell cycle regulation, cell 

20 differentiation or transformation of a host cell. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the general structure of the members of a combinatorial ribozyme 
library annealed to an mRNA encoding the minimal recognition sequence of the reverse 
25 translated zinc finger motif (SEQ ID NO:43), C-X-X-C (X = any amino acid). Upper strand 
(SEQ ID NO:l) is the targeted mRNA with the ribozyme cleavage site indicated. The lower 
stand (SEQ ID NO;2) is a hammerhead ribozyme annealed to the mRNA target. (N = any 
nucleotide^ 



1 i I itT k I t IL ( K. !i ; ,' ll A \ . i f V/ ti' i ' . > ........ 

Figure 3 demonstrates the isolation of cells expressing a selectable marker associated 
with a ribo7vme-exnressing construct from Jurkat T-cell cultures transduced with a library ot 
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ribozymes. The selectable marker is the cell surface molecule Lyt-2 (CD8a). Cells 
expressing the Lyt-2 marker are isolated from the rest of the population using a fluorescence 
activated cell sorter. The X axis depicts marker expression. The Y axis depicts cell number. 
The histogram in Figure 3 A shows the profile of marker expression in transduced cultures. 
5 The histogram in Figure 3B shows the same histogram with an expanded Y axis to reveal the 
marker positive population shifting rightward in the histogram. Marker positive cells isolated 
by flow cytometric cell sorting were grown in culture, and marker expression was re- 
analyzed in the enriched cultures. The histogram in Figure 3C shows results from this re- 
analysis. AH cells in the enriched cultures express the marker, demonstrating the ability to 
10 isolate a stable population of cells expressing a library of pooled ribozymes using this 
method. 

Figure 4 demonstrates the effect of expressing a library of ribozymes on the induction 
of a cellular protein by cells in a culture. Loss of the ability to induce the protein exemplifies 
the loss of a cellular response in ribozyme-expressing cells. The X axis depicts expression of 

15 the induced protein. The Y axis depicts cell number. The histogram in Figure 4A shows the 
profile of induced protein expression in normal cultures (stippled lines) or in cultures 
expressing a library of pooled ribozymes (solid lines). The histogram in Figure 4B shows the 
same histogram with an expanded y-axis to reveal the leftward shifting population of cells, 
corresponding to those cells which have lost the ability to induce the protein. Cells from the 

20 leftward part of the histogram in Figures 3A and B were isolated by flow cytometric cell 
sorting, grown in culture, and induction of the cellular protein was re-analyzed. The 
histogram in Figure 4C demonstrates that the subpopulation of cells which have lost the 
responsive phenotype (represented by the left-hand peak of the histogram) can be enriched 
from cultures expressing several different ribozyme species represented in the original pooled 

25 library. 

BRIEF DESCRIPTION OF PREFERRED EMBODIMENT(S) 

In the present invention, a combinatorial ribozyme library designed for a target 

ribozyme* is intended to mean a synthetic RNA molecule that acts as an enzvme and has 
been engineered to cleave other RNA molecules; after binding selectively to a specific RNA 
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target molecule, it acts catalytically to cut, or cleave, a specific RNA target molecule in a 
region encoding a motif such as a zinc finger, a protein kinase or an integrin. Ribonucleotide 
members of the ribozyme library include a binding region which is complementary to a 
transcription product of the target nucleic acid and a catalytic domain which cleaves a 
5 sequence within a transcription product of the target nucleic acid coding for the motif of 
interest so that expression of the transcription product is disrupted. The binding region 
generally flanks the catalytic domain. The ribozyme library is introduced into a viral vector 
such as a retrovirus vector or a plasmid vector which is then used to infect or transfect a host 
eel! culture that is grown to express the ribozyme library; depending upon the system used, 

10 the vector can be incorporated into the host cell genome or can be episomal. Optionally, the 
DNA of the vector is supercoiled. The host cell culture includes at least one host cell and 
can contain a plurality of host cells. The host cell generally is a mammalian cell but can be a 
lower or higher plant cell, an invertebrate cell or a bacterial cell. The expression of the 
ribozyme in the host cell alters the phenotype of the host cell so that a function for the 

15 product encoded by the target nucleic acid can be assigned based upon the change in 

phenotype. The term "function" is intended to mean a detectable or measurable event. The 
target nucleic acid encodes an expression product that is directly or indirectly involved in a 
measurable function or phenotype in a host cell containing the target nucleic acid. Generally 
the expression product is a protein, including signaling molecules and structural proteins. 

20 The term " motif' is intended to mean a conserved or partially conserved sequence shared by 
a functionally or structurally related class or family of proteins. The term "phenotype" is 
intended to mean a characteristic of a specific cell or cell population and includes physical 
functions such as membrane permeability, physiological functions which include those 
affected directly or indirectly by regulatory effectors, and biochemical and biological 

25 characteristics and functions such as protein synthesis and enzyme activity. The host cell 
exhibiting an altered phenotype is identified using and isolated using any of a variety of 
standard techniques. DNA coding for the ribozyme is identified in the DNA isolated from 
the host cell, conveniently by PCR amplification of the mRNA or genomic DNA coding for 

which not only identities the biologically active ribozyme. hut also the identity ot the target 
nucleic acid. 
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There are several advantages to the subject invention. By targeting the combinatorial 
ribozyme library to conserved or partially conserved motifs associated with known functions 
or properties of proteins or polypeptides containing such motifs, the number of ribozymes 
that need to be constructed and analyzed is significantly reduced (less than about 100,000) in 
comparison to a random library (over 16 million). The ability to eliminate the step of 
amplifying plasmid DNA in bacteria such as E. coli is a major cost saving advantage as well 
as a time saving advantage over existing technologies: removal of the E. coli amplification 
can subtract several labor intensive days from the entire process. Furthermore, the subject 
process lends itself to automation when implemented in a matrix format or a 96-well or 
similar multi-well format. The simultaneous construction, delivery and expression of 
multiple members of a combinatorial ribozyme library and their analysis offers the advantage 
that a large number of ribozymes can be expressed conveniently in host cell cultures, thereby 
enabling the identification of genes and determining the function of genes by a manageable 
high throughput screening process in a relatively short period of time. Furthermore, the 
combinatorial ribozyme library can be constructed with synthetic oligonucleotide DNA which 
offers the additional advantage that directionality is conveniently achieved by incorporating 
unique restriction enzyme sites at both ends of each of the oligonucleotides used to prepare 
the double-stranded DNA coding for these molecules so that double-stranded DNA is ligated 
to the delivery vector in the correct orientation for expression. This overcomes the problem 
that if the same restriction enzyme site, blunt ends or restriction enzyme sites comprising 
compatible cohesive ends are used for the ligation, theoretically about 50% of all the 
constructs would be ligated in the incorrect orientation. Other advantages of the subject 
invention include the capability to regulate the magnitude and timing of nucleic acid 
expression and high throughput delivery. Operatively linking the oligonucleotide DNAs 
encoding the combinatorial ribozyme library to a regulatable promoter provides temporal 
and/or cell type specific control throughout the screening assay. Additionally, the magnitude 
of ribozyme expression can be modulated using promoters that differ in their transcriptional 
activity. 

they arc selective lor a specific target motif sequence and act catalytically. rather than in a 
stoichiometric manner Thus, a single ribozyme molecule can cleave and inactivate up to 
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100 RNA transcripts, while a single antisense or antipeptide molecule will only inactivate one 
RNA transcript or one polypeptide. These properties can be used to identify the role of a 
target genetic sequence and to characterize its cellular function and the function of its 
encoded product. In the disclosed invention, it is not necessary to develop conformational 
5 models of the target nucleic acids to identify regions which are particularly accessible. Such 
models typically are developed using computer-assisted predictions of possible 
thermodynamically stable secondary structures. The need for such computer generated 
models is avoided by creating a combinatorial ribozyme library targeted to nucleic acids 

a: ~ — ;nf a mr» A^^ihnnalK/ cuctain^H /»YnrpQ<:inn nf rihn7vmf* antivifv ran hp 

diCOuuig a ui uii^i i . i luuink/nuu^ , u**kr»«<o V *. — ..f*~~r V w — j - -j - 

10 achieved by utilizing plasmid or viral based expression constructs driven by cellular 

promoters in order to constitutively express high levels of ribozymes directed against the 
target of interest, ensuring sufficient levels of cellular genes are inactivated to cause a 
detectable change in cellular phenotype or response. 

Another advantage to using ribozyme molecules for inactivation of cellular RNA 

15 transcripts is that recognition of an mRNA target by the ribozyme molecule requires the 

complementary base-pairing of only 12-14 nucleotides. Knowledge of the entire sequence of 
the gene of interest is therefore not necessary. This characteristic, together with the 
aforementioned ability of ribozymes to function catalytically makes them useful for 
identifying the roles of genes where oniy partial sequences are known, as well as the roles of 

20 genes where the full length sequence is known. 

By constructing combinatorial ribozyme libraries bearing recognition sites derived 
from DNA or RNA sequences encoding known protein functional motifs, the likelihood that a 
ribozyme in the library will cleave a transcript involved in a "functional" gene is greatly 
increased. An additional advantage to this strategy is that more than one combinatorial 

25 library can be introduced into host cells simultaneously, allowing isolation of genes 

containing combinations of specific motifs, which contributes to specificity of the ribozyme 
for a particular gene. For example, one can isolate with equal ease "genes which are 
transmembrane protein receptors with intracellular tvrosine kinase domains and SH2 regions" 

sequence codine tor a protein motit ot interest Motifs of interest are identified by use ol 
scientific literature; public and/or private databases; and other sources (e.g.. Prosite: 
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http://expasy/hcuge.ch/) that contain information regarding the relatedness of various proteins 
based on amino acid sequence homology. Proteins with one or more shared function or class 
tend to contain similar amino acid patterns or motifs that are common for each class of 
protein. For example, receptor tyrosine kinases, enzymes involved in the transfer of 
5 phosphate to tyrosine residues on protein substrates, often contain the amino acid sequence: 
G-X-H-X-N-[LIVM]-V-N-L-L-G-A-C-T (SEQ ID NO:23) wherein X = any amino acid, and 
[]= containing only one of the amino acids listed within the brackets. Examples of tyrosine 
kinases that contain this sequence are platelet-derived growth factor, macrophage colony 

10 vascular endothelial growth factor (VEGF) receptors Flt-1 and Flk-l/KDR. These molecules 
have been demonstrated to participate in various signal transduction pathways. 

The subject invention is designed to identify molecules, previously known or 
unknown, to have comparable roles in the function of a host cell(s) and to be specifically 
associated with disease states or phenotypes. Other examples of conserved motifs that are 

15 contained in functionally related classes of proteins that are critical for cell function are 

proteases. For instance, caspase-1, known as interieukin-1 beta converting enzyme (ICE), 
represents a family of proteases (caspase-1 to 12) involved in apoptosis which has the 
consensus motifs K-P-K-[LIVMF](4)-Q-A-C-[RQG]-G (SEQ ID NO:24) and H-X(2,4)-[SC]- 
X(4)-[LIVMF](2)-[ST]-H-G (SEQ ID NO:25). For abbreviations, see supra, Caenorhabditis 

20 elegans, ced-3, and Drosophila ICE also contain these motifs. 

Other motifs are shared by proteins that have a common structural relationship. For 
example, the zinc finger motif has been found in a variety of DNA-binding proteins. One 
zinc finger is known as the C3HC4 domain and has the consensus sequence: C-X-H-X- 
[LIVMFY]-C X(2)-C-[LIVMYA] (SEQ ID NO:26). This motif is found in a diverse range 

25 of proteins including the BRCA1 protein that is associated with breast cancer, protein RAG-1 
that is involved in rearrangement of immunoglobulin and T-cell receptor genes and in RO/SS- 
A which is associated with lupus and Sjogren's syndrome. Another example is a portion of 
the inteenn familv that has the conserved sequence: G-X-[GNQ]-Xd ,3VG-X-C-X-C-X(2)-C- 

Mouts lound in protein kinases, integrins. caspases and zinc-finger domains have been 
described The combinatorial ribozyme library, however, can be designed to target the 
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mRNA encoding any protein for which a conserved sequence can be identified. These 
include enzymes such as proteases, structural proteins and signaling molecules. 

Different regions within the same motif can be targeted. In addition, if a family or 
class of proteins contains more than one motif, multiple motifs also can be targeted. The 
5 targeted motifs are not limited to those found in proteins with known mammalian regulatory 
functions but also can be motifs that have only been identified in other organisms such as 
yeast, Drosophila, Caenorhabditis elegans. Therefore, human genes critical to disease 
processes or phenotypes that encode proteins containing motifs similar to those in genes in 
lower eukaryotes can be identified 

10 In general, motifs that are derived from highly conserved sequences, are not desirable 

in making a combinatorial ribozyme library, as the sequence would be present in every 
potential target. By highly conserved is meant that all amino acids found in a contiguous 
sequence of amino acids found in a motif are identical. An optimal situation is where several 
conserved sequence possibilities exist, all of which can contribute to a conserved motif. By 

15 conserved is meant that amino acid sequences in a motif are at least 80 % and more preferred 
at least 90 % identical. This increases the target specificity of the combinatorial ribozyme 
pool. In this case, individual ribozymes contained within the library specifically target the 
production of functionally unique molecules. Ribozymes can be designed to motifs of any 
length. As the length of a motif increases, different ribozymes can be targeted to nucleotides 

20 encoding contiguous conserved or partially conserved amino acid sequences throughout the 
length of the motif. Generally, a combinatorial ribozyme library is designed to target an 
RNA encoding a partially conserved amino acid sequence found in a motif of interest. By 
partially conserved is meant that the amino acid sequences found in a motif are at least 60 % 
identical. 

25 When designing the combinatorial ribozyme library, all combinations of nucleotide 

sequences that give rise to the chosen motif based on codon degeneracy and usage and the 
location of the ribozyme cleavage sites are taken into consideration. The target-binding 
nucleotides of the combinatorial ribozyme library are therefore degenerate This insures that 

to the corresponding antisense strands to term double stranded DNA molecules, When 
transcribed in a host cell culrure, the sense DNA produces RNA which is complementary to 
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an mRNA sequence encoding a motif of interest and contains a catalytic domain designed to 
cleave the mRNA sequence. Each member of a ribozyme library includes two stretches of 
antisense oligonucleotides, each preferably between 5-9 nucleotides (nt) long and optimally 6 
to 8 nucleotides long, to bind to the mRNA, with the sequence forming the catalytic domain 
5 or catalytic core in between. The bases immediately adjacent to either side of the catalytic 
core in the sense strands constitute the ribozyme binding sequence when expressed as RNA 
that is complementary to a mRNA sequence. The mRNA target contains a consensus 
cleavage site for the ribozyme. For hammerhead ribozymes the triplet GUC is best but the 
mttm ^M = onv nnpVnHH^ ako rnn he tarpftteri If the catalvtic domain is derived 

OVl^UWllVV L 1 V-/ 1 ^ V 11 *■ J «™»* — — — - — — o "~ " — — " v " ~ " " — " "" 

10 from a hairpin ribozyme, the triplet GUC is also preferred (Kashani-Sabet and Scanlon, 

(1995) Cancer Gene Therapy 2:213-223; Perriman, etal, (1992) Gene (Amst.) 113:157-163; 
Ruffner, et aL, (1990) Biochemistry 29:10695-10702); Birikh, etal., (1991) Eur. /. 
Biochem. 245:1-16; Perrealt, et aL, (1991) Biochemistry 30:4020-4025). Generally, the 
entire ribozyme-mRNA binding sequence is about 10 to 30 nucleotides in length with 11-17 

15 nucleotides being preferred. The catalytic region generally is about 22 nucleotides in length. 
The catalytic region of miniribozymes is generally about 16 nucleotides in length (Uhlenbeck 
etal. Nature 328:596-603). 

The oligonucleotides for the sense and antisense DNA strands can be simultaneously 
synthesized on solid supports in a matrix format, and simultaneously deprotected and cleaved. 

20 If complementary pairs of oligonucleotides are simultaneously synthesized, deprotected and 
cleaved in a matrix format, they can be simultaneously annealed and ligated to a vector. 
Another method of producing these constructs is to make shorter oligonucleotides with 
shorter complementary regions. Each partially complementary oligonucleotide, each having 
one part out of the two parts of the targeted motif and a restriction enzyme site, is annealed, 

25 extended using a DNA polymerase, and digested with the appropriate restriction enzymes 
prior to ligation. For example, when targeting a zinc-finger motif, Cysteme-X-X-Cysteine, 
where X is any other amino acid, the targeted sequence on the mRNA is 5'- 
NUGYNNNNNNUGY-3' (SEQ ID NO:28) where N is any base and Y is U or C (see 

i> as lollows (SEQ ID NO:30-31 ) i> ~C or I i 
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5 ' GGAA 7TCKCANNNCTG ATG AGTCCGTGAG 

ACTC AGGC ACTCCTGCTTTNYGTN CCTA GG5 ' 

Bold indicates the nucleotides encoding the ribozyme catalytic domain and the underlined 
5 sequences encode the sites that bind to the complementary mRNA encoding the zinc-finger 
motif. After annealing, both strands of the oligonucleotide are extended using a DNA 
polymerase and then restriction enzyme digested, in this example, with EcoR I and BamH I 
(italicized bases in upper and lower strands, respectively). The length of the complementary 
region can be longer or shorter depending on the annealing conditions. Extra bases can be 

10 added at the 5' ends of both strands to improve cutting by the restriction enzymes. 

Alternatively, a single oligonucleotide is annealed to the delivery vector, ligated and the 
complementary strand can be filled in by a DNA polymerase or the complementary strand 
can be filled in before ligation. Three oligonucleotides can be annealed together with the 
delivery vector; intervening gaps are filled in by a DNA polymerase and ligated with a DNA 

15 ligase. 

For annealing the complementary oligonucleotide DNA strands encoding the 
combinatorial ribozyme library, special conditions are not required. For example, both 
strands can be dissolved in water then mixed at about a one to one molar ratio. They can be 
mixed in almost any buffer system, T4 ligase buffer, Exonuclease 3 buffer, Mung Bean 

20 Nuclease buffer. No special heating is required, room temperature is adequate. This mixing 
and annealing of the oligonucleotide strands generally occurs in multiwell microliter dishes 
although other appropriate apparatus also can be used. A means for determining 
directionality of expression can be included in the oligonucleotide DNA. Conveniently the 
means is the use of unique non-blunt end forming restriction enzyme sites at both ends of the 

25 oligonucleotide DNA. so that the two oligonucleotides to be annealed share complementary 
sequences except at the ends where they will be able to bind to a restriction enzyme site. For 
the restriction enzyme sites, any non-blunt end forming restriction enzyme site can be used at 
either end, depending on the sites within the DNA vector into which the oligonucleotide 

ligation by DNA ligase can be used, tor example Aat II. EcoR 1. BamH I. Hind III. Pst I. It 
necessarv. a restriction enzvme sue with a cohesive end can be used with a restriction 
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enzyme site that produces a blunt end. Alternatively, the two oligonucleotides can be 
completely complementary, including the ends and digested with restriction enzymes prior to 
ligation with the delivery vector. In this case, it is preferred that the restriction enzyme sites 
do not occur within the oligonucleotide DNA encoding the ribozyme, otherwise partial 
5 digestions will be required. Restriction enzyme digestion is routinely performed using 

commercially available reagents according to the manufacturer's recommendations and will 
vary according to the restriction enzymes chosen. 

The members of the combinatorial ribozyme library are introduced into any of a 

wi*^ A +ir i.^tnrr HonomHinrr th* flvflilflhilitv of rpQtTirrinn pn7vmp sites intracellular 

KSl ¥ «-> i uvpwi AV* w»» . — j t 

10 location, and transcriptional regulatory elements for delivery and expression of the ribozymes 
in the target host cell. The delivery vector into which the ribozymes are to be ligated is 
digested with the appropriate restriction enzymes, either simultaneously or sequentially, to 
produce the appropriate ends for directional cloning of the oligonucleotides. The 
oligonucleotide DNAs preferably contain compatible ends to facilitate ligation to the vector in 

15 the correct orientation. For synthetic oligonucleotide ligation, the ends compatible with the 
vector can be designed into the oligonucleotides. Alternatively, the compatible ends can be 
formed by restriction enzyme digestion or the ligation of linkers to the oligonucleotides 
containing the appropriate restriction enzyme sites. The vector also can be modified by the 
use of linkers. The restriction enzyme sites are chosen so that transcription of the cloned 

20 oligonucleotides from the vector produces a ribozyme targeted to the mRNA transcribed from 
a target nucleic acid that encodes a motif of interest. 

The vector encoding the ribozyme or ribozyme library also may encode a marker 
protein. The marker protein is used for selection of cells that have been transfected/infected 
with the library-encoding vector. The marker may provide antibiotic resistance. The marker 

25 also may provide for visual selection (for example p-galactosidase or green fluorescent 
protein). The marker can also be a transmembrane protein (for example CD4). 

Once digested, the vector and oligonucleotides can be purified by gel electrophoresis, 
chromatography or phenol /chloroform extraction and ethanol precipitation The optimal 

not contain 5 '-phosphate eroups and. therefore . the phosphate groups on the vector produced 
by restriction enzvme digestion are necessary for oligonucleotide- vector ligation. The 5'- 
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phosphate groups can be added to the oligonucleotides by chemical or enzymatic means 
before or after annealing and the 5 '-phosphate can be removed from the digested vector to 
prevent vector-vector ligation. For ligation, ratios of oligonucleotide DNA to vector DNA 
range from approximately 4:1 to 6: 1 . The ligation reaction is performed using T4 DNA 
5 ligase or any other enzyme that catalyzes double stranded DNA ligation. Reaction times and 
temperature can vary from 5 minutes to 18 hours to from room temperature to 15°C. The 
delivery vector containing the combinatorial ribozyme library optionally is treated to increase 
the supercoiling of the delivery vector DNA, for example using DNA gyrase so as to 

* * I* .1 T"\ 'k T a * a— — _ ~ - ^ — *■ Anil «noU no n nnnlrnntnn stj^ll ^>r" tl-*£L intan^a/^ 

improve UpiaKe Oi UlC Ul\f\ UllU a lCL,ipicm ttti, autu ao a p<u~n.a5iii£ v,cii ui liiw iii^uuwu 

10 target host cell. 

One method for expression of the ribozyme library employs recombinant retroviruses. 
These vectors generally include as operatively linked components, retroviral long terminal 
repeats, packaging sequences and cloning site(s) for insertion of heterologous sequences. 
Other operatively linked components can include a nonretroviral promoter/enhancer and a 

15 selectable marker gene. Examples of retrovirus expression vectors which can be used include 
DC-T5T (Sullenger et al. 1990. Mol. Cell Biol. 10:6512-65230), kat {Blood. 1994 83:43-50), 
BOSC {Proc. Natl. Acad. Sci. (USA) (1993) 90:8392-8396), pBabe (Proc, NatL Acad. Sci. 
(USA) (1995) 92:9146-9150) and RetroXpress™ (Clontech, Palo Alto, CA). 

In some instances, it is desirable to increase expression of the ribozyme library 

20 utilizing other promoters and/or enhancers in place of the promoter and/or enhancers 

provided in the expression vector. These promoters in combination with enhancers can be 
constitutive, tissue specific or regulatable. Any promoter/enhancer system functional in the 
target host cell can be used. (See for example. Molecular Virology pp. 176-177; Hofmann, et 
ai 1996. Proc. Natl. Acad. Sci. (USA) 93:5185-5190; Coffin and Varmus, 1996. 

25 Retroviruses. Cold Spring Harbor Press, NY; Ausubel et al. 1994. Current Protocols in 

Molecular Biology. Greene Publishing Associates, Inc. & Wiley and Sons, Inc.). Examples 
include: CMV immediate-early promoter, SV40 promoter, thymidine kinase promoter, 
metallofhionein promoter and the tetracycline operator f Hoffmann et al . CI 996) Proc Natl 



I I, J N v.. v.. ,li j i| 'ii. I. ! : i , i.., •■ ■ I t L.i V • t V l I V i V I ' » \ . l i , i a, Iv, 'J lull ' i i i NVVvj i 

eukaryouc transcriptional elements and arc Hanked by a retroviral packaging signal and 5 
and 3' LTRs This entire retrovirus construct is functional! v linked to the T7 RNA 
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polymerase promoter and T7 terminator. Also encoded by the vector but not within the 
retroviral construct is a gene functionally linked to a eukaryotic promoter that expresses a T7 
RNA polymerase (T7pol) that contains a nuclear localization signal (T7pol-nls). Following 
transfection of this vector into a retroviral packaging cell, the T7-nls is expressed and 
localized in the nucleus where it transcribes recombinant retroviral genomes that are 
packaged by the retroviral genes expressed by the packaging cell. Because of the high 
transcriptional activity of T7pol-nls, high recombinant retrovirus titers can be achieved. 
Similar vectors, utilizing other DNA-dependent RNA polymerases, such as, SP6 or T3 also 
can be used. 

To package the recombinant retrovirus vectors containing the ribozyme library, cell 
lines are used that provide in trans the gene functions deleted from the recombinant retrovirus 
vector such that the vector is replicated and packaged into virus particles. The genes 
expressed in trans encode viral structural proteins and enzymes for packaging the vector and 
carrying out essential functions required for the vector's expression following infection of the 
target host cell. Packaging cell lines and retrovirus vector combinations that minimize 
homologous recombination between the vector and the genes expressed in trans are preferred 
to avoid the generation of replication competent retrovirus. Packaging systems that provide 
essential gene functions in trans from co-transfected expression vectors can be used, as can 
packaging systems that produce replication competent retroviruses. Following packaging, the 
recombinant retrovirus is used to infect target cells of interest. The envelope proteins 
expressed permit infection of the target cell by the recombinant retrovirus particle. 
Retrovirus packaging cell lines which can be used include BOSC23 (Proc. Natl. Acad. Sci. 
(USA) 90:8392-8396), PT67 (Miller and Miller. 1994. /. Virol. 68:8270-8276, Miller. 1996. 
Proc. Natl. Acad. Sci. (USA) 93:11407-11413), PA317 (Mol. Cell Biol. 6:2895 (1986)), 
PG13. 293 cells transfected with pIK6.1 packaging plasmids (US Patent No. 5,686,279), 
GP + envAM12 (Virology 167:400 (1988), PE502 cells (BioTechniques 7:980-990 (1989)), 
GP + 86 (Markowitz, etal. 1988. J. Virol. 62:1120-1124), y-Cre (Danos and Mulligan. 
1988. Proc. Natl. Acad. Sci. (USA) 85:6460-6464). The preferred titer of recombinant 

! : ; . I : \ : , . ; + i ' ! ' : * s ■ • : ■■ ■ . i [ ? * i \ M [if-.. I u ■ r ,■ i r * ■ ; i ^ j 

In addition to recombinant retrovirus systems, other viral packaging svstems such as 
adenovirus-associaied virus (AAV), adenovirus. Sindbis virus. Semliki Forest virus, Fpstein 
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Ban* virus, herpes simplex virus, HIV, or vaccinia virus can be used. Each of these systems 
has a different host range and can be used to infect cells that are refractory to retrovirus 
expression (i.e., non-dividing cells). In the Sindbis virus system (Invitrogen, San Diego, 
CA), the oligonucleotides to be expressed are ligated into the multiple cloning site of a 
5 Sindbis virus DNA vector, e.g. pSinRep5, operatively linked to a Sindbis subgenomic 

promoter and polyadenylation site; the oligonucleotides replace the Sindbis virus structural 
protein genes. pSinRep5 includes an SP6 RNA polymerase promoter for the in vitro 
synthesis of recombinant Sindbis virus genomes; a packaging signal for recombinant RNA 
packaging; and the Sindbis nonstpjcmral polyprotein gene open reading frame. For the 
10 production of Sindbis virus particles, the recombinant Sindbis vector encoding the 

oligonucleotide DNA is linearized, transcribed into RNA and co-transfected into vertebrate 
(BHK-21, Vero) or invertebrate cells (Drosophila) with RNA transcribed from the helper 
vector, pDH-BB, that encodes the viral structural proteins. Following transfection, the 
recombinant Sindbis genomic RNA acts as a mRNA, is translated into the Sindbis virus 
15 polymerase, and expresses the encoded ribozyme and the structural proteins from the helper 
RNA. Because of Sindbis virus' wide host range, the recombinant Sindbis virus can be 
packaged and used to express the ribozyme library in mammalian, avian, reptilian, insect 
cells (e.g., mosquito and Drosophila cells). See for example, Xong, C. et al. (1989) Science 
243:1188-1191; Huang, H.V. et al. (1993) United States Patent Number 5,217,879; Hahn 
20 C.S. et al. (1992) Proc. Natl. Acad. Sci. (USA) 89:2679-2683; Huang, M. and Sommers, J. 
(1991)7. Virol. 65:5435-5439. 

For ribozyme expression in AAV, the oligonucleotide DNA is cloned into an AAV 
expression vector, such as ALAPSN, that contains a cloning site functionally linked to a 
promoter such as a Moloney leukemia virus promoter and flanked by AAV terminal repeats 
25 and a packaging signal, a means for selection. As an example, ALAPSN comprises a 

neomycin resistance gene functionally linked to SV40 transcription control elements. Similar 
AAV vectors, such as CWRSP and CWRSP.N. with comparable features also can be used. 
As an example, to produce recombinant AAV panicles, 293 cells are infected with 

( 1^89} J Virol 03:3822-3828). As recombinant AAV is produced, the 293 cells undergo 
cyiopathology. becoming spherical and lose their ability to adhere to a tissue culture surface. 
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Following development of maximal cytopathology the supernatant and/or cell lysate is 
harvested and, if necessary, concentrated (Halbert et al. 1997. 7. Virol. 71:5932-5941). 
Other methods for producing recombinant AAV also can be used, for example as described in 
USPN 5,354,678. The combinatorial ribozyme library also can be expressed using 
adenovirus expression systems as described in USPN 5,631,236, USPN 5,670,488, 
W094/28152, WO95/1409I, EP0707071, WO96/05321, WO95/14101, WO95/14102, 
WO97/00326, EP94202322.7. 

For vaccinia virus expression, a replication competent vaccinia virus can be used. 
The oligonucleotides to be expressed are operatively linked to a vaccinia virus promoter, for 
example, PI 1. In a preferred embodiment, vaccinia virus strain MVA is used because it 
expresses recombinant genes but contains a deletion that renders it replication incompetent in 
mammalian cells. Therefore, nucleic acids can be expressed in target host mammalian cells 
without the development of vaccinia virus induced cytopathology. The recombinant vaccinia 
virus strain MVA is produced by infecting chicken embryo fibroblasts (CEF) with vaccinia 
MVA and transfecting the transfer vector, pGOl, into which has been ligated the ribozyme 
and a marker gene (beta galactosidase) functionally linked to a vaccinia promoter, such as 
PI 1, and flanked by the MVA genome sequences that flank the site of the MVA genomic 
deletion. The PI 1-ribozyme/beta-galactosidase construct is inserted into the MVA genome 
by homologous recombination. Recombinant viruses can be identified by in situ staining for 
beta-galactosidase expression with X-gal (Wyatt et ai (1995) Virology 210:202-205). 

The combinatorial ribozyme library also can be expressed from plasmid expression 
vectors that are transfected directly into target host cells, including mammalian cells, 
although an intervening bacterial amplification step can be performed prior to delivery of the 
library to the target host cells. The direct delivery of the plasmid expression vector into the 
target host cells without an intervening bacterial cloning or transformation step is preferred 
because it provides a significant savings in time and expense and increases the number of 
genes and ribozyme libraries that can be studied. Expression plasmids contain cloning sites 
operatively linked to transcriptional regulatory elements functional in the target host cells. 



^anscnpiioiKii regulator) element. pL.l-.P-> t im lirogen. >an i;teL'i'. i p^ \n u 
(Clontech. Palo Alto, CA), pAlter MAX (Promega. Madison, WI). The plasmid preferably 
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10 



contains sequences to provide high-copy episomal replication and selectable markers for 
stable maintenance of the vector in the host cell. The plasmids containing the oligonucleotide 
DNA are transfected directly into the target cell of interest. To increase transfection 
efficiency, preferably the plasmids are supercoiled with a gyrase. The oligonucleotides 
encoding the combinatorial ribozyme library alternatively can be ligated into plasmids and 
functionally linked to the T7, SP6, T3 or a similar RNA polymerase promoter. The plasmid 
expression vectors that can be used include pGEM-3Z and pAlter*-£rl (Promega, Madison, 
WI). The plasmid-oligonucleotide DNA construct is transfected into mammalian cells that 
are infected with a vaccinia strain such as MVA that expresses the appropriate RNA 
polymerase (Wyatt etaL, (1995) Virology 210:202-205). For the example of vaccinia MVA 
T7, the T7pol transcribes the oligonucleotides from the plasmid vector. The vaccinia MVA 
amplifies the plasmid-oligonucleotide DNA construct copy number, resulting in an increased 
intracellular template concentration for T7pol transcomplementation and increased ribozyme 
expression and thus activity. 
15 0t her systems for the expression of nucleic acids functionally linked to T7 RNA 

polymerase or other bacteriophage promoters (SP6 or T3) also can be used. Ribozyme 
expression can be performed with a recombinant retrovirus vector containing the 
oligonucleotides encoding the ribozyme functionally linked to a T7 RNA polymerase 
promoter (T7pro) and T7 terminator. This expression cassette is flanked by 5' and 3' LTRs, 
20 a packaging signal and includes the T7pol gene, that encodes a T7pol that contains a nuclear 
localization signal (T7pol-nls), functionally linked to a eukaryotic promoter. In this system, 
the expressed T7 protein is transported to the nucleus for transcription. Due to the high 
transcriptional activity of T7pol, high intracellular levels of ribozyme can be achieved. 
Optionally, the ribozyme can be fused to second ribozyme that acts intramolecularly to free 
25 the ribozyme targeting the mRNA of interest. 

Transfection of nucleic acid (DNA or RNA) encoding the combinatorial ribozyme 
library into cells is required for either packaging of recombinant vectors into virus panicles 
or direct transfection of plasmids that express the combinatorial ribozyme library into target 



" dl Ultn a Jaic,um phosphate nucleic acid co precipitate Poiycations such as polymeric 
(about 4 10 mierograms/ml). a poiycation that acts by neutralizing the net negative surfac 
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charges on the virus and cells (Stoker. "Retroviral Vectors" In Molecular Virology: A 
Practical Approach, Davison and Elliott, eds., p 187) or DOSPER (Boehringer-Mannheim) 
also can be used to increase the efficiency of transfection of low molecular weight DNA. 
Liposomes are available from a variety of commercial suppliers and include DOTAP™ 
5 (Boehringer-Mannheim), Tfx™-50, Transfectam®, ProFection™ (Promega, Madison WI), and 
LipofectAmin™, Lipofectin®, LipofectAce™ (GibcoBRL, Gaithersburg, MD). In solution, the 
lipids form vesicles that associate with the nucleic acid and facilitate its transfer into cells by 
fusion of the vesicles with cell membranes or by endocytosis. Alternatively, DNA can be 
introduced into cells by electroporation. Each of these systems differ in their transfection 

10 efficiency for a given cell line. If transfection conditions for a given cell line have not been 
established or are unknown, they can be determined empirically (Maniatis supra). The 
number of ribozymes expressed per cell depends on the multiplicity of infection for a virus 
vector or the amount of DNA transfected per cell for a plasmid vector. 

From one to multiple ribozymes in a chosen delivery vector are introduced into the 

15 recipient cell. When a retrovirus vector is used, following preparation of recombinant 
retrovirus in a packaging cell, the recombinant retrovirus is used to infect a host cell 
containing a mRNA transcribed from a target nucleic acid encoding the motif of interest. 
The infected or transfected host cell is grown and the phenotype of the infected or transfected 
host cell is analyzed to determine any alterations in phenotype as compared to an uninfected 

20 or untransfected host cell. Optionally, infected or transfected cells are isolated or selected 

from the population of normal cells. Alterations in cell phenotype are then correlated so as to 
assign a function to a product coded for by the cleaved mRNA target. DNA encoding the 
ribozyme expressed in the host cell can be isolated and sequenced to identify the sequence of 
the target mRNA, the gene from which it is transcribed and the encoded protein. This can be 

25 done, for example, by PCRing the ribozyme encoding sequence for example, from cellular 
DNA, or reverse transcription-PCR of RNA, then sequencing the ribozyme encoding 
sequence from the viral or plasmid expression vector in the target cell. 

The target host cell can be any cell of interest that expresses a disease associated 

; 1 1 ci i v u > pc . » nno/Mnc nnrai\ designed k> cleave me transcription product oi me target 
nucleic acid is constructed and expressed in the target host celhs) that are then assayed for an 
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altered cell phenotype. The altered phenotype can be any phenotype which can be detected, 
for example modified cell growth, DNA synthesis, synthesis of a protein(s), chemical 
responsiveness, apoptosis, morphologic changes, cell viability, replication, differentiation, 
expression of biologically active compounds (e.g., steroids), proliferation, drug 
5 susceptibility, the expression of cell surface molecules such as receptor molecules and 

antigens. Proteins that regulate gene expression in cells also can be identified. For example, 
this can be accomplished by monitoring the expression of a reporter gene expressed from a 
promoter that is active in, for example, tumor cells in the presence of ribozymes targeted to 
an mRNA encoding a motif known to function as a DNA binding protein. Conversely, for 
10 the identification of proteins responsible for the maintenance of a normal cell phenotype, 
ribozymes can be constructed to mRNAs encoding proteins that contain a motif of interest 
and which are expressed in the normal cells which are then assayed for an altered phenotype. 
In either approach, proteins and genes associated with disease pathways or phenotypes can 
be identified. 

15 In order to evaluate an alteration in cell phenotype, any of a variety of methods can be 

used, depending at least in part on the phenotype of interest and the function associated with 
the targeted motif In host cells which amplify and express ribozymes, phenotypic change 
can be monitored directly. For example, if the function of a protein containing the targeted 
motif prevents apoptosis and it is inhibited by one or members of a combinatorial ribozyme 

20 library, the host cell undergoes specific types of morphologic changes, such as nuclear 

condensation and DNA fragmentation, following ribozyme expression. If the targeted motif 
is found in a' protein that is involved in drug susceptibility, this function can be identified by 
monitoring cells for altered resistance or susceptibility to the particulr drug or drugs. 

Combinatorial ribozymes also can be used for functionating cellular and viral motif 

25 containing genes that are involved in virus replication. Combinatorial ribozyme libraries can 
be targeted to mRNAs encoding protein motifs that are postulated to be involved in a virus' 
Iifecycle. If the targeted motif is found in a protein that affected replication, virus titers or 
cytopathic effects may increase or decrease. Various stains can be employed to determine 

.)uc rumination and transformation this can t)c monitored b\ measuring the incorporation ot 
a labeled nucleotide into the cell Antibody-based assays can be employed to detect the 
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presence or absence of a protein of interest such as a cell membrane receptor. Additional 
types of assays known to those of skill in the an can be employed depending on the 
phenotype or cellular property that is being analyzed. 

Phenotypic change also can be monitored, for example, by evaluating ribozyme 
5 activity by comparing the targeted mRNA levels in cells expressing and cells that do not 
express one or more members of the combinatorial ribozyme library. Total cellular or 
cytoplasmic RNA can be purified by a variety of methods (Maniatis supra pp. 7.6-7.29) and 
analyzed by Northern or dot blot (Maniatis supra pp. 7.37-7.57). mRNA can be assayed by 
reverse transcriDtion-PCR employing primers that flank the targeted cleavage site(s). The 

10 absence or decreased production of a PCR product is indirectly indicative of ribozyme 
activity (Baier et aL 1994. Molecular Immunology 31:923-932). 

The methods and compositions of the subject invention can be used to identify the 
function of nucleic acids encoding proteins containing motifs of interest. Motif-directed 
ribozyme libraries can be designed and constructed to target virtually any sequence encoding 

15 a motif for which a conserved or nearly conserved sequence can be identified. Conserved 
sequences have been described for caspases and protein kinases. In addition many other 
classes of enzymes can similarly be targeted. A conserved sequence encoding a zinc-finger 
domain that is found in many proteins has been described. Similarly, a sequence conserved 
in integrins has been described. Therefore, it is possible to target motifs in numerous 

20 proteins including enzymes, adhesion molecules, signaling molecules and structural proteins 
having a variety of physiological functions including enzyme activity, protein synthesis, 
biological factor expression or regulatory effector function, which alter various cellular 
phenotypes or responses including changes in cellular proliferation kinetics, changes in 
cellular viability, resistance to facilitated cell death, resistance to antibiotics, magnetic 

25 separation, directed migration, and preferential adhesion. 

Following the identification of cells that exhibit an altered cellular phenotype in 
response to expression of a ribozyme, the host cell having an altered cellular phenotype is 
isolated or selected for on the basis of expression of an appropriate marker, which can be for 

drue selection, usual selection, or methods based on enzymatic activity 

DNA or RNA is isolated from the host cell by standard molecular biology techniques 
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and can be PCR amplified for sequencing as an initial step towards characterization of the 
corresponding gene, transcription product and protein. For example, this can be done by 
PCR amplifying the ribozyme-coding region of the viral or plasmid vector that delivered the 
ribozyme to the cell. The primer pair used to amplify the ribozyme sequence is derived from 
the vector sequences flanking the ribozyme insert. The PCR product is then sequenced to 
determine the mRNA sequence targeted by the ribozyme(s). Based on this information, the 
entire gene sequence can be determined and the sequence of the encoded protein can be 
deduced. PCR products also can be cloned into vectors for farther analysis, or used as 
probes for identification of target nucleic acids. 

In the following examples, a combinatorial ribozyme library is targeted to a zinc 
finger motif, C-X-X-C. Included in the library, at an equivalent proportion to the other 
components, is a ribozyme targeting green fluorescent protein (GFP). Green fluorescent 
protein (GFP) from the firefly Aequorea victoria emits bright green light upon exposure to 
UV light without the requirement of additional proteins, substrates, or cofactors. EGFP 
encodes a protein that has a single, red shifted spectrum and increased expression relative to 
GFP, and therefore, is easily monitored in living cells by fluorescence microscopy and 
fluorescence-activated cell sorting (FACS). 

To demonstrate that the combinatorial ribozyme library can be used to inactivate 
genes involved in a given phenotype, the library targeting the zinc finger motif and 
containing the EGFP-targeted ribozyme is introduced into CHO-A A8 Tet-Off cells or 293 
Tet-Off cells (Clontech, Palo Alto, CA) that express EGFP. Ribozyme activity is inversely 
proportional to reporter gene signal. Alternatively, the zinc finger motif can be incorporated 
into the EGFP sequence, expressed in either cell type described above, and inactivated by the 
combinatorial ribozyme library. Other reporter genes, for example chloramphenicol 
acetyltransferase (CAT), beta-galactosidase, or alkaline phosphatase, also can be used. 

Kits containing combinatorial anti-motif ribozyme libraries also are provided. The 
containers of kit can contain a combinatorial library directed to motifs either as individual 
members of the library, or as a complete library. Optimally the kit contains vectors including 



recognition of a marker protein and K'k primers roi amplication or me nucleotides 
encoding the ribozvmes. 
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The following examples are offered by way of illustration and not by way of 
limitation. 



EXAMPLES 
Example 1 

Design of Combinatorial Ribozyme Library to a Zinc Finger Motif 
The following example discloses methods to design double stranded DNA 
oligonucleotides that code for a combinatorial hammerhead ribozyme library targeted to the 
zinc finger motif, C-X-X-C (X=any amino acid). Hammerhead ribozymes contain two 
recognition domains that are complementary to the mRNA encoding the motif of interest. 
Each recognition domain is composed of at least 6 nucleotides flanking both ends of the 
catalytic core. The optimum cleavage site of the target mRNA is a U. Based on these 
considerations the minimal target sequence contained in the coding sequence of the motif and 
cleavable by a ribozyme is determined. Also, considered is the known degeneracy of the 
genetic code. Mammalian codon usage may also be considered. Thus, for the zinc finger 
motif, described above with the amino acid sequence. C-X-X-C, the sequence is first reverse 
translated to: 5'- UGY-NNN-NNN-UGY (SEQ ID NO:32) (N=any nucleotide, Y = C or U). 
The amino acid sequence is scanned for amino acids that are preferably found at or near the 
middle of the motif that are coded for by codons that contain a U as a fixed position. In this 
case none are available, therefore, to target a ribozyme library to this sequence requires 
fixing one of the variable residues as a U and extending the recognition sequence arms of the 
ribozyme 5' and 3' from this position to include the less or invariable elements characteristic 
of the sequence. Taking into consideration that the hammerhead ribozyme contains two 
recognition sequences each comprised of 6 nucleotides, the minimal recognition sequence of 
the zinc motif is: 5'- NUGYNt/NNNNUGY (SEQ ID NO:33) wherein Nt/N=cleavage 
recognition site, with cleavage occurring 3' to the underlined nucleotide. The underlined 
nucleotide is not targeted by the ribozyme because it does not hybridize or anneal with the 



lhe numner or nbozymcs target inu the minimum .sequence is calculated h> 
multiplying the number ot nucleotides that may occupy each position ot the binding regions 
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Omitted from this calculation is the nucleotide 5' to the cleavage site (underlined nucleotide) 
because it is not part of the ribozyme binding region. For the above example, the number of 
ribozymes to be made equals: 4x1x1x2x4x1x4x4x4x1x1x2=4096. 

Oligonucleotides encoding the combinatorial ribozyme library to the zinc finger motif 
are made in a 96-well matrix using parallel array technology and annealed to form double 
stranded DNA with unique Hind III and Cla I sites at each of the 5' and 3' ends, 
respectively, for ligation into a retrovirus vector. The general structure of two 
complementary oligonucleotides of the combinatorial library is as follows, with the catalytic 
core of a hammer head ribozyme in bold. The underlined regions are the ribozyme binding 
sequences when expressed as RNA that are complementary to all possible permutations of a 
mRNA sequence encoding the zinc finger motif (SEQ ID NO:34-35). 

5 * AGCTT RCANNN CTGATGAGTCCGTGAGGACGAA ANRCAN AT 3 ' 

3 ' AYGTNNNGACTACTCAGGCACTCCTGCTTTNYGTNTAGC 5 ' 

For annealing, approximately 1.0 microgram of each complementary oligonucleotide 
is dissolved in water and mixed at a one to one molar ratio in a 96-well microtiter plate at 
room temperature. The 5' end (left end) of the double stranded DNA fragment overlaps with 
an Hind III restriction enzyme site. The 3' end of the fragment (right end) overlaps with a 
Cla I site. 



Example 2 

Preparation of a Family of Retrovirus Plasmid Vectors 
The purpose of this experiment is to prepare a retrovirus plasmid vector library 
containing the double stranded oligonucleotide DNA encoding the combinatorial ribozyme 
library to the zinc finger motif described in Example 1, supra. pLNCX, (50 micrograms, 
Clontech. Palo Alto, CA), which contains an extended viral packaging signal, multiple 
cloning site and neomycin resistance gene flanked by the Moloney murine leukemia virus 5' 



' Acl1 Pl 11 mum UC11 K v >(> ueih piaie i ne imrai> ui anneal ojigunuueuiides prepared 
in Example 1. supra, are added individually at 4 to b told excess of the Hind lll/Cia 1 treated 
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pLNCX. The oligonucleotide DNA is ligated into pLNCX by adding a tenth volume of 10X 
T4 DNA ligation buffer and T4 DNA ligase. The final concentration of the ligation buffer 
components and T4 DNA ligase are: 0.05 M Tris-HCl (pH 7.6), 10 mM MeCI, 10 mM 
dithiothreitol (DTT), 50 /xg/ml bovine serum albumin (Fraction V; optional), 1.0 mM ATP, 
0.05 Weiss units of bacteriophage T4 DNA ligase per microliter. The ligation is performed 
for 4-8 hours at 16°C. 

Also prepared, is a retrovirus vector encoding a known, active anti-EGFP ribozyme 
following the identical protocol. 



Example 3 

Transfection of Mammalian Cells Using Retrovirus Plasmid Vectors 
The purpose of this experiment is to package the retrovirus plasmid vector library 
prepared in Example 2, supra, and an identical retrovirus plasmid containing a specific anti- 
EGFP ribozyme in retrovirus particles. Using a calcium phosphate precipitation method 
(Keck, et al. (1990) Cell 61:801-809, Cochran, et aL (1985) Proc. NatL Acad. ScL (USA) 
82:19-23) for transfecting DNA into mammalian cells, 0.1 to 0.5 micrograms of ligated 
plasmid/oligo DNAs from Example 2, supra, is transfected into approximately 1,000 to 
25,000 PT67 cells per well of a 96-weIl plate cultured in 100 ^1 of minimal essential media 
(MEM) supplemented with 10% fetal calf serum (FCS). Four hours later the media is 
replaced with fresh MEM supplemented with 10% FCS and incubated at 37°C for 48 hours. 
The rerrovirus vector contains a neomycin resistance gene; therefore. G418 selection is used 
to obtain a population of cells that stably express the transfected vector and to monitor virus 
titers. Recombinant retrovirus production is monitored by titering aliquots of the transfected 
cell supernatant in a focus forming assay in which cells infected with the recombinant 
retrovirus become resistant to G4I8 (Clontech, Palo Alto, CA). When virus titers > lOVml 
are reached, usually between 2-7 days, the viruses are harvested, and random pools of 
retroviruses containing the combinatorial library and the specific EGFP ribozyme are made. 
These pools are used to infect into target cells. CHO-AA8- Tet-Off cells (Clontech. Palo 



These cells arc counted then added to multi-well dishes seeded with target cells The ratio ot 
retrovirus producing PT67 cells and targets cells can vary from about 1 to 10. As the 
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retrovirus particles emerge from the PT67 ceils, the CHO-EGFP cells are infected. 

Example 4 
Analysis of Ribozyme Activity 
This experiment is designed to demonstrate that a ribozyme in a combinatorial library 
can inactivate its mRNA resulting in an altered phenotype of the target host cell. In this 
experiment, ribozyme activity is inversely proportional to EGFP expression. The ribozyme 
hybridizes to and cleaves the EGFP mRNA, thereby, reducing EGFP protein expression. 

runur.PP f^nntmh Pain Aim PA^ nr** rnlhirerl to near rnnfliienrv or 

approximately 50.00-75,000 cells per well of a 96-welI plate in MEM with 10% FCS. The 
recombinant retroviruses library containing the retrovirus encoding the anti-GFP ribozyme 
from Example 3, supra, are randomly pooled and used to infect the cultures of CHO-EGFP 
cells. The multiplicity of infection (vims particle per cell ratio) is about 5-10 to insure that 
every cell per well is infected with at least one virus particle. Infection of target cells is 
enhanced with polybrene (generally 10 micrograms/ml). Mock infected cells or parental 
CHO cells that do not express EGFP serve as controls. Following infection, the cells are 
incubated for 48 hours at 37°C and then assayed for EGFP expression. 

EGFP expression is assayed by EGFP fluorescence using an incident light at 488 nm 
and measuring the emitted light at 507 nm. The emitted or observed light is detected using 
the appropriate set of filters, corresponding to the wavelength of the incident and emitted 
light with a Wallac- Victor Flurometer or by a Florescence Activated Cell Sorter (FACS). 
Cells with decreased EGFP production were analyzed by PCR to confirm the presence of the 
EGFP-specific ribozyme. 

25 Example 5 

Preparation of Plasmid Vector for Non-Retroviral Transfection 
This example discloses the construction of plasmid vectors that express the 
combinatorial ribozymes The oligonucleotide ribozyme library containing the anti-FGFP 

Epstein Harr virus (EBV)-hased vector that is maintained exirachromosomally in primate and 
canine cell lines. pCEP4 contains the nuclear antigen. EBNA-1, for high-copy cpisomal 



10 



15 



20 
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replication of the plasmid by the EBV origin of replication, oriP< and the hygromycin 
resistance gene for stable maintenance of the vector. In this example, the oligonucleotides 
are designed to contain Hind III and BamH I sites to facilitate ligation into the expression 
vector. The anti-EGFP ribozyme is synthesized and cloned as described for the 
5 combinatorial ribozyme library but also containing Hind III and BamH I sites. The synthesis, 
annealing and ligation procedures are the same as those described for the retrovirus vectors in 
Example 2, supra. 

Example 6 

10 Transfection of Mammalian Cells (Non-Retroviral-Mediated Transfection) 

This experiment demonstrates the delivery of plasmid DNA encoding the EGFP 
ribozyme and the combinatorial ribozyme library to mammalian cells. Random pools of 
pCEP4 DNAs containing the EGFP ribozyme and the combinatorial library are introduced 
into host cells by calcium-phosphate precipitation (Cochran et al. (1985) Proc. Nail. Acad. 

15 ScL (USA) 82: 19-23, Keck et al. (1990) Cell 61:801-809). The pCEP4 DNA was treated 
with gyrase (Mizuuchi et al. (1984) J. Biol. Chem. 259:9199-9201; Bates et al. (1996) 
Biochemistry 35: 1408-1416) to increase the transfection efficiency. Gyrase treatment is 
carried out for 1 hour at 25°C in 35 mM Tris-HCl (pH 7.5), 24 mM KC1, 4 mM MgCl 2 , 1-8 

mM spermidine, 9 microgram/ml tRNA, 5 mM dithiothreitol , 6.5% (w/v) glycerol, 100 
20 microgram/ml bovine serum albumin, 12 nM gyrase, 1 mM ATP (Bates et aL 1996. 

Biochemistry 35:1408-1416). In either case, by increasing the amount of transfected DNA 
from the ligation reaction, more than one ribozyme targeted to a specific sequence is 
transfected per cell. This increases the probability that the target mRNA is inactivated and an 
altered phenotype is produced. 

25 

Example 7 
Analysis of Ribozyme Activity 

M ~ 

This experiment is designed to demonstrate the inactivation of FGFP expression in 
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The ribozyme hybridizes to and cleaves the FGFP mRNA. thereby, reducing EGFP protein 
expression. FGFP is assaved as described above and the presence of the ami FGFP 
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ribozyme is confirmed according to the procedure described in Example 4, supra. 

CHO-EGFP cells (Clontech, Palo Alto, CA) are cultured to near confluency in 96- 
well plates in MEM supplemented with 10% FCS as described in example 4. supra. The 
plasmids for expression of the combinatorial ribozyme library and the anti-EGFP ribozvme 
from Example 6, supra, are randomly pooled and used to transfect CHO-EGFP cells. 
Approximately, 0.1-0.5 micrograms of plasmid DNA are used to transfect approximately 
each well of the CHO-EGFP cells cultured in 96-well microtiter dishes to insure that every 
cell was transfected with at least one plasmid. Mock transfected cells or parental CHO cells 
serve as controls. Following transfection, the CHO-EGFP ™\u ar* inmhatPH tw ar.ii 
hours. EGFP is assayed as described in Example 4, supra. 

Example 8 

Design of a Combinatorial Ribozyme Library to the 
Receptor Protein Kinase Motif 
The following example discloses methods to design double stranded DNA 
oligonucleotides that code for a combinatorial hammerhead ribozyme library targeted to the 
receptor protein kinase motif, G-X-H-X-N-[LIVM]-V-N-L-L-G-A-C-T (X=any amino acid; 
[J=position contains one of the enclosed amino acids). The sequence is first scanned for 
amino acids that are preferably found at or near the middle of the motif that are coded for by 
codons that contain a U as a fixed position. In this particular case the position containing 
Leucine (L). Isoleucine (1), Valine (V) or Methionine (M) is coded for by nucleotides as 
follows: 



L.CUA 


I: AUA 


V:GUA 


cue 


AUC 


GUC 


CUG 


AUU 


GUG 


cm; 




GUU 


UUA 







M:AUG 



Ihu.s. 1- I. \ or M could be coded lor b\ a codon or the sequence: N I ' N iN ^ ; ,n\ 
nucleot.de ). Hence, a V is fixed ,n the second position of this codon and can be used as pan 
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of a ribozyme cleavage site. As described in Example 1, supra, if a fixed U residue can not 
be found in the motif, one can be fixed into a variable position. 

Taking into consideration that the hammerhead ribozyme contains recognition 
sequences comprised of 6 nucleotides and codon degeneracy, the minimal motif target is, X- 
5 N-[LIVM]-V-N (SEQ ID NO:36), which is reverse translated to yield the minimal nucleotide 
target for the combinatorial ribozyme library: 5'-N-A-A-Y-N-£/-N-G-U-N-A-A-Y (SEQ ID 
NO: 37) wherein N=any nucleotide; Y=C or U; Nt/N = ribozyme cleavage site, with 
cleavage occurring 3' to the underlined nucleotide. The underlined nucleotide is not targeted 

w»r +Uc% fiUnTUT^ia Kanouro it Hrw»c nr\t VtirRrisli?** nr annual with thf* hinrlincy c#»mi#»nrpc nf thf* 

10 ribozyme. The number of individual ribozymes necessary to target all possible nucleotide 
combinations that may be translated into the motif is calculated as described in Example 1 : 
4x1x1x2x4x1x1x1x4x1x1x2=256. The general sequence of the expressed combinatorial 
ribozyme library is (SEQ ID NO: 38): 

15 5 - RUUNAC CUGAUGAGUCCGUGAGGACGAA ANRUUN 

(R = G or A). The underlined regions are the ribozyme binding sequence when expressed as 
RNA that are complementary to the mRNA encoding the receptor protein kinase motif and 
the bold region is the catalytic core of the ribozyme. Other ribozyme libraries also can also 
20 be targeted to other regions of this motif using the guidelines described above. 

Example 9 

Design of a Combinatorial Ribozyme Library to the Integrin Motif 
The following example discloses methods to design oligonucleotides that code for a 
25 combinatorial hammerhead ribozyme library targeted to the integrin motif: C-X-[GNQ]- 
X(l,3)-G-X-C-X-C-X(2)-C-X-C (SEQ ID NO:39). Following the guidelines described in 
Examples 1 and 8, the G-X-C-X-C (SEQ ID NO:40) can be easily targeted where the amino 
acid. C (Cvsteine), is encoded bv UGU or UGC and the amino acid. G (Glvcine) is encoded 
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encoding the combinatorial ribozyme are (SEQ ID NO:42) 
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5 - CANNNR CTGATGAGTCCGTGAGGACGAA ANNNNC . 

The underlined regions and bold region are the ribozyme binding sequences and catalytic 
core, respectively. The number of ribozymes necessary to target all possible nucleotide 
5 combination is 32,768. Other ribozyme libraries also can be targeted to other regions of this 
motif using the guidelines described above. 

Example 10 

T.-^lo*^*-! TJIo C vnrACctna o Qf»l wtahl a A/farW*r A ccnpiarpH with q 

1 ■ ! ' ' -" ' ' ■ ' 

10 Ribozyme Expressing Construct 

The following experiment demonstrates the ability to isolate cells expressing a 
selectable marker associated with a ribozyme expressing construct from cell cultures 
transduced with a library of pooled ribozymes. A pool of ribozymes directed against the Ick 
gene was synthesized using oligonucleotides encoding a hammerhead ribozyme catalytic core 

15 flanked by nucleotide sequences complementary to sequences in the Ick mRNA. Specific 
restriction endonuclease sites were also engineered into the oligonucleotides to facilitate 
directional cloning and recovery of ribozyme sequences. Sense and antisense 
oligonucleotides were annealed to form a double stranded DNA which was then ligated in to 
an AAV based plasmid vector using T4 ligase in a manner similar to that described in 

20 Example 2. DNA constructs encoding at least 12 different ribozyme molecules cloned into 

AAV plasmid vectors were transfected into a virus packaging cell line, and recombinant virus 
was harvested from the supernatant in a manner similar to the one outlined in the Preferred 
Embodiments (page 16, lines 23-29). The resulting recombinant virus was then used to infect 
the Jurkat T-cell leukemia line. Infected cultures were harvested and stained with a 

25 fluorescently labelled antibody (Pharmingen, San Diego, CA) directed against the selectable 
(Lyt-2/CD8a). Marker expression on the cells was then analyzed by flow cytometry. This 
analysis revealed a small population of marker bearing cells present in the population (see 
Figure 3A). This population of cells was isolated using fluorescence activated cell sorting 

a i u . ...:\, : . i 1 U..I ' M r. i . ; ! ; 1 1 , i ' , . ....... . ..i , u; ; C i - i U ) i L.i UL :;■ , i ^ i . i j % i ■ villi:. 

method, 100** ot the sorted population was found to express the selectable marker (sec 
Figure 3B). suggesting that the recombinant ribozyme expressing AAV genome had been 
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stably integrated into the host cell genome. These results demonstrate that cells expressing 
ribozyrne constructs can be isolated and separated from those that do not and that these cells 
can be expanded in culture for further analysis of cell phenotype or responsiveness. 

Example 11 

Isolation of a Population of Cells Which Have Lost a Specific Cellular Response 
This experiment demonstrates that a population of cells that have lost a specific 
cellular response can be isolated from a culture of cells expressing a library of pooled 
ribozymes. Jurkat cells expressing the AAV/riho7.yme plasmid selectable marker (Lvt- 
2/CD8a) derived from the experiment described in Example 10 were stimulated through T- 
cell receptor engagement with anti-CD3 plus anti-CD28 (Pharmingen, San Diego, CA) 
(Weiss etaL, (1991) Semin. Immunol., 3:313-324; Abraham etal., (1992) Trends Biochem. 
Sci. y 17: 434-438), for 40 hours, and the induction of CD69, a cell surface activation protein 
whose expression is dependent upon Lck protein kinase function (Goldsmith and Weiss, 
(1987) Proc. Natl Acad. Act. USA, 84:6879-6883; Straus and Weiss, (1992) Cell, 70:585- 
593) was analyzed by flow cytometry using a fiuorescently labeled antibody (Pharmingen, 
San Diego, CA). This analysis showed that a fraction of ribozyrne containing cells had lost 
the ability to induce the activation related protein following cell stimulation (see Figure 4A). 
This population of unreactive cells was isolated using fluorescence activated cell sorting and 
recultured. Upon re-analysis, an enrichment of the unresponsive population was observed. 
These results show that a population of cells with a specific altered phenotype or response 
can be isolated from a culture of cells expressing a library of pooled ribozymes directed 
against a specific gene. In this particular example, unresponsive cells were isolated from the 
rest of population using flow cytometry. 

Example 12 

Recovery of Ribozyrne Sequences from Altered Cells Which 
E xpress a Library of Pooled Ribozvme s 

'esponse. and that onj\ a limited number or ribozyrne species are recovered as compared lo 
he number or species present in the original library ot pooled ribozymes. Jurkat cells 
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derived from the experiment described in Example 1 1 which had been sorted for the lack of 
specific protein induction were lysed in lx Taq PCR buffer/0.45% NP-40/0.45% Tween-20, 
and ribozyme sequences were amplified from cellular DNA in a standard PCR reaction using 
lx Taq PCR buffer/ 1.5 mM MgCl 2 /200 dNTPs/0.2 M M oligonucleotide primers/0. 625U 
5 Taq polymerase (Promega, Madison, WI). Amplification was performed for 30 cycles, using 
a melting temperature of 94°C for 30 seconds per cycle, an annealing temperature of 60°C for 
30 seconds per cycle, and an extension temperature of 72°C for 45 seconds per cycle. A final 
incubation of 72°C for seven minutes followed the final amplification cycle. The primer pair 

10 flanking the ribozyme insert, and had the sequence 5'-ATCCGCGTCCTAGGCACGTGA-3' 
(SEQ ID NO: 44) and S'-GTTACTAGTCCGCGGCTCGAC-S' (SEQ ID NO: 45). PCR 
products containing ribozyme sequences were cloned into pCR2.1-TOPO (Invitrogen, 
Carlsbad, CA). Cloned DNAs were transformed into bacteria and the transformed bacteria 
were plated on LB-agar/amp plates. Colony purified ribozyme clones were then sequenced, 

15 and the identities of the ribozymes associated with the loss of cell responsiveness were 

ascertained. While the original library of pooled ribozyme sequences consisted of 12 or more 
individual ribozyme species, the majority of clones (42/45) sequenced contained ribozymes of 
a single species. Only one other ribozyme species was represented in this particular analysis 
(3/45 clones). These results indicate that isolation of specific ribozyme sequences associated 

20 with the loss of a specific cell phenotype or response is possible, even when these ribozyme 
sequences constitute a minor component of a larger library of pooled ribozymes. Knowledge 
of the specific ribozyme sequences associated with the loss of cellular function can be used to 
clone and/or identify previously known or unknown cellular genes involved in generating a 
specific cellular phenotype or response using standard molecular biologic techniques. 

25 

The above examples describe methods and compositions for construction of a 
combinatorial ribozyme library and its high throughput delivery and intracellular expression 
to determine the function of a product(s) encoded bv a tareet nucleic acid that contains a 

interest, the construction of vectors that express nucleic acids that encode a combinatorial 
ribozyme library; the ligation of the oligonucleotides into retrovirus vectors, other viruses, or 
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plasmid vectors; the packaging of the recombinant vector into virus particles; the expression 
of the encoded library from cells either infected with the virus particles or cells directly 
transfected without a bacterial amplification step with the recombinant plasmid expression 
vectors. The results demonstrate that a combinatorial ribozyme library expressed from either 
recombinant virus or recombinant plasmid expression vectors inactivate a target nucleic acid 
to produce an altered cellular phenotype, and that both the specific ribozyme species and the 
targeted cellular gene associated with that altered cellular phenotype can be identified, so that 
a function can be assigned to the target nucleic acid. 

All publications and patent applications mentioned in this specification are indicative 
of the level of skill of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same extent as 
if each individual publication or patent application was specifically and individually indicated 
to be incorporated by reference. 

The invention now having been fully described, it will be apparent to one of ordinary 
skill in the art that many changes and modifications can be made thereto without departing 
from the spirit or scope of the appended claims. 
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WHAT IS CLAIMED IS. 

1 . A method of assigning a function to a target nucleic acid comprising a nucleotide 
sequence encoding a motif of interest, said method comprising the steps of: 

growing a host cell culture comprising one or more host cells that contain one or more 
ribonucleic acid members of a combinatorial ribozyme library to express said one or more 
ribonucleic acid members, wherein each said ribonucleic acid member comprises a binding 
region complementary to a transcription product of said target nucleic acid and a catalytic 
domain which cleaves a sequence within said transcription product of said target nucleic acid 
which codes for said motif of interest so that expression of said transcription product is 
disrupted and an altered host cell is produced; and 

correlating phenotypic changes in said altered host cell, as compared to a control host 
cell, with the identity of said target nucleic acid by isolating DNA from said altered host cell 
and determining the specific ribozyme sequence contained therein which is complementary to 
said target nucleic acid , whereby a function for said target nucleic acid is assigned based 
upon said phenotypic changes in said altered host cell. 

2. The method according to claim 1, wherein said function is a physiological 
function. 



3. The method according to claim 1. wherein said function is enzyme activity. 

4. The method according to claim 1 . wherein said function is protein synthesis 

5. The method according to claim 1, wherein said function is biological factor 
expression. 



6. The method according to claim 1, wherein said function is membrane 
permeability 



unction 
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8. The method according to claim 7, wherein said regulatory effector function affects 
induction of a physiological function. 

9. The method according to claim I, wherein said function is altered directly. 

10. A method for identifying one or more members of a combinatorial ribozyme 
library which alters a phenotype of a host cell, said method comprising the steps of: 

growing a cell culture comprising one or more host cells wherein said host cells 
comprise a transcription product of a target nucleic acid encoding a motif of interest; 

contacting said cell culture with one or more members of a combinatorial ribozyme 
library which bind to said transcription product whereby expression of said transcription 
product is disrupted and said host cells exhibit one or more phenotypic changes, whereby said 
host cells exhibiting one or more phenotypic changes are isolated, whereby said one or more 
members of said combinatorial library are identified. 

1 1 . The method according to claim 1 or claim 10, further comprising the step of; 
utilizing the nucleotide sequence of said one or more members of said library which 

disrupt expression of said transcription product as a probe to identify nucleotide sequences of 
said transcription product and said target nucleic acid, whereby said nucleotide sequences of 
said transcription product and said target nucleic acid are obtained. 

12. The method according to claim i or 10, wherein said host cell culture comprises 
a plurality of mammalian cells, bacterial cells, invertebrate cells or plant cells. 

13. The method according to Claim 1 or 10, wherein said motif is a zinc finger 
motif, a receptor protein kinase motif, or an integrin motif 

14 The method according to Claim 1 or 10, wherein said transcription product is 



l.v A double stranded DNA comprising: 

a sense strand and an antisense strand, wherein said sense strand codes tor a catalytic 
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domain which when expressed as RNA disrupts expression of a mRNA transcribed from a 
target nucleic acid encoding a motif of interest and binding regions flanking said catalytic 
domain for binding said RNA to said mRNA, and wherein a means for determining 
directionality of expression is included in said double stranded DNA. 

16. A vector comprising; 

a double-stranded DNA according to claim 15. 

17. The Vector according tO Claim 16 wherein saiH Hnnhl^-stranHpH nMA fiirthf»r 

comprises regulatory elements for expression. 

18. The double stranded DNA according to claim 15, wherein said means for 
determining directionality of expression comprises a different non blunt-ended restriction 
enzyme site at each end of said double stranded DNA. 

19. The double stranded DNA according to claim 15, wherein said RNA is a 
ribozyme. 

20. A retrovirus expression vector comprising: 

a retrovirus plasmid vector comprising a double stranded DNA according to claim 15. 

21. The retrovirus expression vector according to claim 20, wherein said vector 
comprises supercoiled DNA. 

22. A retrovirus packaging cell line comprising: 

a retrovirus expression vector according to claim 20. 

23. An adeno-associated virus expression vector comprising: 



An adeno-associated virus packaging cell line comprising: 
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an adeno-associated virus piasmid vector according to claim 23 and an adeno- 
associated virus helper piasmid. 

25. A piasmid expression vector comprising: 
a double stranded DNA according to claim 15. 

26. A combinatorial library comprising: 

the double-stranded DNA of claim 15 wherein said binding regions are degenerate. 

27. The combinatorial library according to claim 26 wherein said target nucleic acid 
encodes a protein. 

28. The combinatorial library according to claim 27 wherein said protein is an 
enzvme. 

29. The combinatorial library according to claim 28 wherein said enzyme is a protein 
kinase or a protease. 

30. The combinatorial library according to claim 27 wherein said protein contains a 
zinc-finger domain. 

3 1 . The combinatorial library according to claim 27 wherein said protein contains an 
integnn domain. 

32. The combinatorial library according to claim 27 wherein said protein is a 
signaling molecule or a structural protein. 

33. The double stranded DNA according to claim 15 wherein said motif is a zinc 

w A retrovirus particle comprising. 

a genome encoding an RNA comprising a catalytic domain which cleaves mRNA 



SUBSTITUTE SHEET (RULE 26) 



WO 99/41371 



PCT/US99/03I66 



39 

transcribed from a target nucleic acid encoding a motif of interest, and binding sequences 
flanking said catalytic domain for binding said RNA to said mRNA. 

35. A mammalian cell comprising: 

one or more double stranded DNA(s) comprising a sense strand and an antisense 
strand, wherein said sense strand codes for a catalytic domain which when expressed as RNA 
cleaves a mRNA sequence transcribed from a target nucleic acid encoding a motif of interest, 
and binding regions flanking said catalytic domain for binding said RNA to said mRNA, and 
wherein a means for determining directionality of expression is included in said one or more 
double stranded DNA(s). 

36. An adeno-associated virus comprising: 

a genome encoding of an RNA comprising a catalytic domain for cleavage of a mRNA 
transcribed from a target nucleic acid comprising a sequence encoding a structural motif of 
interest, and binding regions flanking said catalytic domain for binding said RNA to said 
mRNA. 

37. A ribozyme comprising; 

a catalytic domain and binding domains complementary to a nucleotide sequence 
encoding a motif of interest, that disrupts expression of said nucleotide sequence. 

38. The ribozyme according to claim 37, wherein said nucleotide sequence is RNA. 
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Figure 2 

1) 5-CCAGCTC C TGA TGA GTC CGT GAG GAC GAA ACCAGGA-3 ' 

2) 5 '-GGCCGTT C TGA TGA GTC CGT GAG GAC GAA ACGTCGC-3 ' 

3) 5*-CTCGCCG C TGA TGA GTC CGT GAG GAC GAA ACACGCT-3' 

4) 5*-GCAGATG C TGA TGA GTC CGT GAG GAC GAA ACTTCAG-3' 

5) 5' TGGTCAC C TGA TGA GTC CGT GAG GAC GAA AGGGTGG-3' 

6) 5 -AGCGGCT C TGA TGA GTC CGT GAG GAC GAA AAGCACT-3* 

7) 5'-CATGGCG C TGA TGA GTC CGT GAG GAC GAA ACTTGAA-3' 

8) 5 -GCTCCTG C TGA TGA GTC CGT GAG GAC GAA ACGTAGC-3' 

9) 5'-CGTCCTT C TGA TGA GTC CGT GAG GAC GAA AAGAAGA-3 ' 

10) 5 -CGCCCTC C TGA TGA GTC CGT GAG GAC GAA AACTTCA-3' 

1 1 ) 5 -TGCGGTT C TGA TGA GTC CGT GAG GAC GAA ACCAGGG-3 * 

12) 5 -CCTCCTT C TGA TGA GTC CGT GAG GAC GAA AAGTCGA-3 • 

13) 5 M3TAGTTG C TGA TGA GTC CGT GAG GAC GAA ACTCCAG-3' 

14) 5'-TGATATA C TGA TGA GTC CGT GAG GAC GAA ACGTTGT-3' 

15) 5 -GGATCTT C TGA TGA GTC CGT GAG GAC GAA AAGTTCA-3' 

1 6) 5'-GGTCGGC C TGA TGA GTC CGT GAG GAC GAA AGCTGCA-3' 

17) 5'-GCAGCAG C TGA TGA GTC CGT GAG GAC GAA ACGGGGC-3' 

1 8) 5 -CAGGGCG C TGA TGA GTC CGT GAG GAC GAA ACTGGGT-3' 

— j ^w^^rv^ ^ M %*n ioav,i^^v,j GAG G AC GAA AGAGTGa-j 
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[2: INFORMATION FOR SBQ ID NO;2: 

(l) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEETNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 
RCANNNCUGA UGAGUOOGUG AQGAOSAAAN RCAN 34 
(2) INFORMATION FDR SEQ ID NO:3 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY': linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 




i2': INFORMATION FOR SEQ ID M0:4: 



u) SEQUENCE CHARACTERISTICS: 
(A) LENGTH -. 36 base pairs 
(B; TYPE: nucleic acid 



(C; STRANDEENESS: single 
iD) TOPOLOGY: linear 



:j) MOLECULE TYPE: other nucleic acid 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGIH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(n) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 5 : 
CTOQCCGCIG ATGACTOOCTT GAGGAOGAAA CACULT 3b 
v2) INFORMATION FOR SBQ ID NO: 6 : 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SBQ ID N0:6: 
GCAGATGCIG ATGAGTCCGT GAGGAOGAAA CITCAG 36 
( 2 i INFORMATION FOR SEQ ID NO : 7 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LEK7IH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: other nucleic acid 



<xi'> SEQUENCE DESCRIPTION : SEQ ID NO: 7: 
TOOICACCIG ATCAGTCCXJT GAGGACGAAA Q37K3G 36 



. .. -l. J ... ■ .k : , 1 ..icx l. : ^ 

:B- TYPE: nucleic ac:.:: 

: C . 5 TRANDEEXES 3 : s ing 1 1 

Pi TOPOLOGY: lineal- 
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MOLECULE TYPE: other nucleic acid 



(>u) SEQUENCE DESCRIPTION : SBQ ID NO:8: 
AGCQGCTCTC ATGACTOOGT GAGGAOGAAA GGCACT 36 
(2) INFORMATION FOR SEQ ID NO: 9 : 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDETNESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 9: 
CATQGQ3CTG ATT3AGTCQTT GAGGAOGAAA CTTGAA 36 
(2) INFORMATION FOR SBQ ID NO: 10: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LEN7TH: 36 hase pairs 

(B) TYPE: nucleic acid 

(C) STRAKEEENESS : single 

(D) TOPOIOGY: linear 

in! MOLECULE TYPE: other nucleic acid 



ixi) SEQUENCE DESCRIPTION: SEQ ID N0:1C: 
GCTOCITXTC ATT3ACTXOT GAGGAOGAAA OGTTAGC 36 
(2! INFORMATION FOR SEQ ID NO: 11 : 

(i; SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGACTCOGT GAGGAOGAAA AGAAGA 36 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENCTIVI : 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: other nucleic acid 



(xi) . SEQUENCE DESCRIPTION: SEQ ID N0:12: 
O3COCT0CTG ATGAGTCOGr GAGGAOGAAA ACITCA 36 
(2) INFORMATION FOR SEQ ID NO: 13: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STOANDECNESS: single 

(D) TOPOLOGY: linear 

(li; MOLECULE TYPE: other nucleic acia 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GTTCTG ATGAGTO07I GAGGAOGAAA OCAGGG 36 
INFORMATICS FOR SEQ ID NO: 14 : 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

mi MOLECULE TYPE: other nucleic acid 
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U> SEQUENCE CHARACTERISTICS: 

(A) LENC7IH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDELMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID 170:15 : 
GTAGTTCCTC ATSAGT0CX7T GAGGAOGAAA CKTAG 36 
(2) INFORMATION FOR SEQ ID NO:16: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii} MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 : 

TGATATACTC ATGACTCOGT GAGGACGAAA OJTTGT 36 

(2) INFORMATION FOR SEQ ID NO: 17: 

(l) SEQUENCE CHARACTERISTICS: 
(AJ LEMTTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: other nucleic acid 



(>::) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
V-rjYTCVG ATGAGTCCGT G^QGAOSAAA AGTTTCA 36 



STRAI^TIDNESS . srrai- 
'APOLOGY: lmeai 
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(11) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGTO3GCCIG ATGACTCOGT GAGGACGAAA GCIGCA 36 

(2) INFORMATION FDR SEQ ID IX): 19: 

ii) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
{B) TYPE: nucleic acid 

(C) STRANEEENESS : Single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 19: 

GCAGCAGCTC ATGACT0CX7T GAGGACGAAA C3GGGC 36 

(2) INFORNFVTICM FOR SEQ ID NO:2C: 

(l) SEQUENCE CHARACTERISTICS: 
!A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(n) MOLECULE TYPE : other nucleic acid 



(xi ; SEQUENCE DESCRIPTION: SBQ ID N0:20: 

CAGGGCGCT3 ATGAGICCGT GAGGACGAAA CTGGGT 36 

-:2) INFORMATION FOR SEQ ID M0:2I: 

( : > SEQUENCE CHARACTERISTICS : 
(A; LENGTH : 36 base pairs 
(B; TYPE : nucleic acid 
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CCAGCAGCTC ATGAGTCOGT GAGGAOGAAA CCATGT 
(2) INFORNPVTICN FDR SBQ ID ND:22: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LEMjIH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID KD:22: 
CTATCCOCTO AT3AGTCOGT GAGGAOGAAA GAGTCA 
(2) INFORMATION FOR SEQ ID NO: 23 : 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STKANDECNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID ND:23: 

Gly Xaa His Xaa Asn Xaa Val Asn Leu Leu Gly Ala Cys Thr 

15 10 

(2) INFORMATION FOR SEQ ID ND : 24 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDGdsESS : not relevant 

(D) TOPOLOGY : linear 



til) MOLECULE TYPE: peptide 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : not relevant 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID WD: 25 : 

His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa His Gly 

15 10 

2) INFORMATION FOR SBQ ID NO: 26: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDECMESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

Cys Xaa His Xaa Xaa Cys Xaa Xaa Cys Xaa 

15 10 

) INFORMATION FDR SBQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS : not relevant 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 



ixi' SEQUENCE DESCRIPTION: SEQ ID NO: 27 



: ■ SEQUENCE CHARACTER ITT I OS : 
' A ' LENGTH : I* h-is- onir<- 



WO 99/41371 



10 



PCT/US99/03166 



(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

In) MOLECULE TYPE: other nucleic acid 



(Xl) SEQUENCE DESCRIPTION: SEQ TD NO:28: 
NUGYNMWNN UGY 13 
(7) INFORMATION FOR SEQ ID N0:29: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

lii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ 3D NO:29: 
RCANNNNNRC A 11 
(2) INFORMATION FOR SEQ ID NO: 30: 

ii) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

in! MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 : 
GGAATTCRCA NNN'CTGATOA CTCOGIOAG 29 
;2; INFORMATION FOR SBQ ID NO: 31; 



i : MOLECULE TYPE . c the r nunei c ac i c 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
GGATCOTCGY NITiUJIOCT CACOGACTCA 
(2) lOTORMOTON FOR SEQ TD NO. 32 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS ; single 

(D) TOPOLOGY: linear 

di) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION SEQ ID NO: 32 : 
UGYNNNNNNU GY 

(2! INFORMATION FOR SEQ ID NO: 33 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHjIH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : Single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE; other nucleic acid 



ixi; SEQUENCE DESCRIPTION: SBQ ID NO: 33: 
Nt JGYNUNNNN UGY 13 
(2) INFORMATION FOR SBQ ID NO: 34: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 
{C> STRANDEENESS : single 
(D) TOPOLOGY; linear 
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INFORMATION FOR SBQ ID KO:3S: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MXBCULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SBQ ID N3:35: 
CGATiJTGYNT TTOGTCCTCA QOGACTCATC AQJNNIGYA 39 
(2) HJFORmTICN FDR SBQ ID N0:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(n) MDLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: S0Q ID ISO: 36 

Xaa Asn Xaa Val Asn 

1 5 

INFORMATION FOR SBQ ID NO: 3 "7: 

Ui SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

•in) MOLECULE TYPE: other nucleic acid 



sfcuehce eHARAiCrE^isTi:! 
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(B) TYPE: nucleic acid 

(C) STEANDEDNESS : single 

(D) TOPOIOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID ND:38: 

RUUNAOCUGA UGAGUOC3UG AGGAOGAAAN RUUN 

(2) IHFOPMATI ON POP- SEQ TP NQ*39- 

(i) SEQUENCE CHARACTERISTICS : 
{A) t.^tth : 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 39: 

Cvs Xaa Xaa Xaa Gly Xaa Cys Xaa Cys Xaa Xaa Cys Xaa Cys 

l' 5 10 

(2) INFORMATION FOR SEQ ID NO:40: 

(l) SEQUENCE CHARACTERISTICS; 

(A) LENGTTtt: S amino acids 

(B) TYPE: amino acid 

(C) STRANDECMESS : not relevant 
ID) TOPOLOGY : linear 

(11) MOLECULE TYPE: peptide 



(>:iJ SEQUENCE DESCRIPTION: SBQ ID NO: 40: 

CAy Xaa Cys Xaa Cys 

1 D 



. .. : j * : - - • : . 

r TxTE: . aci-:: 

STPA^a-ESL . Einq^-. 

r : t~pcl x Imeai' 
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(11) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID M0:41: 
Q^NNNUGYNN NUG 

(2) INFORMATION FOR SEQ ID NQ:42: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGIH: 34 base pairs 
(R) TYPE? nurleic acid 

(C) STTONDECNESS : single 

(D) TOPOLOGY : linear 

(n)' MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE ASCRIPTION: SB2 ID NO: 42: 
CANNNRCT3A T3AGTPC0GIG AGGACGAAAN NNKC 34 
(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEI2JESS : not relevant 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
Cys Xaa Xaa Cys 

(2) INPORmTION FOR SEQ ID NO: 44: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(r\ TYPE: nucleic acid 
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(xa) SEQUENCE DESCRIPTION: SEQ ID WO: 44 : 
ATCCGCCTCC TAQGCACCT3 A 21 
(2) INFORMATION FOR SBC ID NO:45: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LEN7IH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOXjOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GTTACTAGTC 03CGGCPCGA C 



21 



INTERNATIONAL SEARCH REPORT 



I * nation** Application No 

PCT/US 99/03166 



A. CLASSIFICATION OF SUBJECT MATTER ,*,^,/™ M « llc /o£ riOWC/in 

IPC 6 C12N15/11 C12N9/00 C12Q1/68 C12N15/86 C12N5/10 



Accordi ng to international Patent Classification (IPC) or to both national class rf cation and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classrficatwn symbols) 

IPC 6 C12Q G01N C12N 

Documentation searched other than minimum documentation to the extent that sucn documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 


Citation ot document, with indication, where appropriate, of tne relevant passages 


Relevant to clam No 


X 


DE 44 24 762 C (MAX-PLANCK-GESELLSCHAFT) 


15-19, 




27 July 1995 (1995-07-27) 


25-27, 




35,37,38 


Y 


the whole document 


1-38 


Y 


WO 97 27212 A (RIGEL PHARMACEUTICALS INC) 


1-38 




31 July 1997 (1997-07-31) 






cited in the application 






page 3, 1 ine 6 - 1 ine 27 






page 5, line 1 - line 10 






page 21, line 25 - page 22, line 2 






page 24 , line 7 - page 27, line 3 






page 31 - page 34, line 7 




X 


WO 94 13833 A (INNOVIR LAB INC) 


37,38 




23 June 1994 (1994-06-23) 






abstract; claims 





f 




further documents are listed in the continuation ot box C 



0 Special categones ol cited documents 

"A" aocumeni defining the general state of the art whicn is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
tiling aate 

"L" document wnich may tnrow doubts cn priority clatmoi c 
wnicn ts cited to estabiisn the publication dale of ano:rnv 
citation or otner special reason (as specified) 

"C" document reternng lo an oral disclosure, use, exhfcntion or 
other means 




Patent family memoers are listed m annex 



T" later document published after the international tiling date 
or pnonfy date and not tn conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance, the claimed invention 
cannot be considered novel or cannot be considered to 
nvoive an inventive step wnen me document is taken aione 

Y document of particular relevance, the claimed invention 

cannot be considered to involve an invenlive step wnen the 
document is combined wrth one or more other sucn docu- 
^nonts, such comrjination betrq obvious to a cer^on skilled 



uuiy 1999 



c 7/0 7/ 1999 



Nano ;ir.d -nsMnq address of tne tSA 

Ejropean Paten: Office. P B 58 1 Q Patentiaar 
Nl ??8^ HVRi sw"V 



Auihonzea officer 



. t rl r p 



INTERNATIONAL SEARCH REPORT 


1. .national Application No 

PCT/US 99/03166 


C.(Conttnuailon> DOCUMENTS CONSIDERED TO BE RELEVANT 


Category J 


Citation ot document, with mdtcation.whftre appropriate of tne relevant oassages 


Relevant to ciatrn No 


A 


XIE Y F ET AL: "A ribozyme -mediated, 
gene ''knockdown'' strategy for the 
identification of gene function in 
zebrafish" 

PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF THE UNITED STATES OF AMERICA, 
(9 DEC 1997) VOL. 94, NO. 25, PP. 
13777-13781. , XP002108912 
page 13780, right-hand column, last 
paragraph 




1-38 


A 


WO 96 38553 A (MOURITSEN & ELSNER A S 
; JENSEN MARTIN ROLAND (DK); PEDERSEN FINN 
S) 5 December 1996 (1996-12-05) 
page 4, 1 ine 10 - line 24 
page 5, line 19 - page 8, line 23 
claims 




1-38 


A 


STONE B AND WHARTON W: "Targeted RNA 
fingerprinting: The cloning of 
differentially -expressed cDNA fragments 
enriched for members of the zinc finger 
gene family" 
NUCLEIC ACIDS RESEARCH, 
vol. 22, no. 13, 1994, pages 2612-2618, 
XP002090069 
ISSN: 0305-1048 
the whole document 




13, 

28-30,33 


A 


JOHNSON . S. ET AL.: "Identification of 
zinc finger mRNAs using domain-specific 
different i al di spl ay" 
ANALYTICAL BIOCHEMISTRY. , 
vol. 236, 1996, pages 348-352, XP002108913 
ISSN: 0003-2697 
the whole document 




13,30,33 


P.X 

1 
1 


W0 98 32880 A (BARBER JACK ; IMMUSOL INC 
(US); TRITZ RICHARD (US); WELCH PETER (US) 
30 July 1998 (1998-07-30) 

page 4, 1 ine 13 - 1 ine 27 

page 21 - page 34 

page 52, paragraph G. - page 68 

page 76, paragraph E. - page 81 

claims 




1-3,5, 

9-12, 

15-17, 

19,20, 

22-27, 

34-38 



I 

i 



r a c p 2 of 



INTERNATIONAL SEARCH REPORT 



i. national Application No 

PCT/US 99/03166 



C ^Continuation! OOCUMENTS CONSIDERED TO BE RELEVANT 



Category - ; Citation oi document, with indication, wnere aoproonate, of trie relevant passages 



WO 98 50530 A (BELLON LAURENT ;KARPEISKY 
ALEXANDER (US); MCSWIGGEN JAMES A (US);) 
12 November 1998 (1998-11-12) 



claims 1-33,70 
examples 6,7 



Relevant to aaim No 



1,2,5, 

7-12, 

14-19, 

25-27, 

35,37,38 



cage 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



li .lational Application No 

PCT/US 99/03166 



1 

Patent document 


1 

| Publication 




Patent family 




Publication 


cited in search report 


' date 




memoerts) 




date 


DE 4424762 C 


27-07-1995 


WO 


9601314 


A 


18-01-1996 






EP 


0776363 


A 


04-06-1997 


WO 9727212 A 


31-07-1997 


AU 


1707497 


A 


20-08-1997 






AU 


1707897 


A 


20-08-1997 






CA 


2244222 


A 


31-07-1997 






EP 


0877752 


A 


18-11-1998 






wo 


9727213 


A 


31-07-1997 


WO 9413833 A 


23-06-1994 


AU 


675482 


B 


06-02-1997 






AU 


5739694 


A 


04-07-1994 






EP 


0681613 


A 


15-11-1995 






JP 


8507202 


T 


06-08-1996 






US 


5589332 


A 


31-12-1996 



WO 


9638553 


A 


05-12-1996 


AU 


699568 


B 


10-12-1998 










AU 


5893396 


A 


18-12-1996 










CA 


2222879 


A 


05-12-1996 










EP 


0832207 


A 


01-04-1998 


WO 


9832880 


A 


30-07-1998 


AU 


5927598 


A 


18-08-1998 


wo 


9850530 


A 


12-11-1998 


AU 


7290598 


A 


27-11-1998 



A. , it'."'" '■■■(■ , ,4' f ■ . Jl , 



